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Novel Serine Protease Genes Related to DPPIV 


Field of the Invention 

The present invention relates to novel serine proteases related to dipeptidyl 
peptidase IV (DPPIV), and to isolated nucleic acids coding for these proteases, all of 
which are useful for the discovery of new therapeutic agents, for measuring protease 
5 activity, and for determining the inhibitory activity of compounds against these 
proteases. 

Background of the Invention 

Proteases and peptidases are enzymes that catalyse the hydrolysis of peptidic 
amide bonds Proteases play an important role in the regulation of biological processes 
10 in almost every life-form from bacteria to virus to mammals- They perform critical 
functions in, for example, digestion, blood clotting, apoptosis, activation of immune 
responses, zymogen activation, viral maturation, protein secretion and protein 
trafficking. They can be classified according to a number of criteria, such as site of 
action, substrate preference, and mechanism. So, for example, aminopeptidases act 
15 preferentially at the N-terminal residues of a peptide, while carboxypeptidases act 
preferentially at the C-terminus and endopeptidases act at sites removed from the two 
termini. Among the carboxy- and aminopeptidases, peptidyl peptidases cleave a single 
amino acid residue from the substrate, dipeptidyl peptidases cleave a dipeptide unit (two 
amino acids) from the substrate, and tripeptidases cleave three amino acids from the 
20 substrate. Substrate preference is frequently expressed in terms of the amin o acid 
residue immediately N-terminal to the cleavage site. For example, trypsin-like 
peptidases will preferentially cleave a peptide next to a basic amino acid (arginine or 
lysine), i.e. where the bond hydrolysed is the Arg/Lys-Xaa bond. As another example, 
the chymotiypsin-like family of peptidases preferentially hydrolyse peptides adjacent to 
25 an aromatic residue. Mechanistically, peptidases are classified as being serine- 
dependent, cysteine-dependent, aspartic acid-dependent or zinc-dependent. 

Because peptidases and proteases are involved in the regulation of many 
physiological processes, they are attractive targets for the development of therapeutic 
agents. Protease and peptidase inhibitors are, for example, used in the treatment of 
30 hypertension, coagulation disorders, and viral infection. 

Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, 
being found in viruses, bacteria and eukaryotes. Over 20 families (denoted SI - S27) of 
serine protease have been identified; these are grouped into 6 clans (SA, SB, SC, SE, SF 
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and SG) on the basis of structural similarity and other functional evidence. Structures 
are known for four of the clans (SA, SB, SC and SE); these appear to be totally 
unrelated, suggesting at least four evolutionary origins of serine peptidases and possibly 
many more, Rawlings and Barrett, Meth. Enzvmol. 244: 19-61 (1994). 

5 The prolyl oligopeptidase family consists of a number of evolutionarily related 

peptidases whose catalytic activity seems to be provided by a charge relay system similar 
to that of the trypsin family of serine proteases, but which evolved by independent 
convergent evolution. A conserved serine residue has been shown experimentally (in E. 
coli protease II as well as in pig and bacterial PE) to be necessary for the catalytic 
10 mechanism. This serine, which is part of the catalytic triad (Ser, His, Asp), is generally 
located about 150 residues away from the C-terminal extremity of these enzymes (which 
are all proteins that contains about 700 to 800 amino acids). 

One of the most intensively studied proiyi oligopeptidases is dipeptidyl peptidase 
IV (DPPIV, EC 3.414.5), a type II glycoprotein, which is the only well characterised 
15 dipeptidyl aminopeptidase known to be located on the outer side of plasma membranes. 

As indicated above, dipeptidyl aminopeptidases are characterised by their ability to cleave 
N-terminal dipeptides from a variety of small peptides. Dipeptidyl aminopeptidases show 
different substrate specificities and cellular localisation, suggesting different functions of 
each activity in peptide processing. DPPIV is characterised by its capacity to cleave N- 
20 terminal dipeptidcs containing proline or alanine as the penultimate residue. The DPPIV 
gene spans approximately 70 kb and contains 26 exons, ranging in size from 45 bp to 1.4 
kb. The nucleotide sequence (3,465 bp) of the cDNA contains an open reading frame 
encoding a polypeptide comprising 766 amino acids. The nucleotides that encode the 
active site sequence (G-W-S-Y-G) are split between 2 exons. This clearly distinguishes 
25 the genomic organisation of the prolyl oligopeptidase family from that of the classic serine 
protease family. 

DPPIV is widely distributed in mammalian tissues and is found in great 
abundance in the kidney, intestinal epithelium and placenta (Yaron, A. and Naider, F., 
Critical Reviews in Biochem. Mol. Biol. 1993 [1], 31). In the human immune system, 

30 the enzyme is expressed almost exclusively by activated T-lymphocytes of the CD4 + 
type where the enzyme has been shown to be synonymous with the cell-surface antigen 
CD26. Although the exact role of DP-IV in human physiology is still not completely 
understood, recent research has shown that the enzyme clearly has a major role in human 
physiology and pathophysiology. 
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On human T cells, DPPIV expression appears late in thymic differentiation and is 
preferentially restricted to the CD4 + helper/memory population, and CD26 can deliver a 
potent co-stimulatory T-cell activation signal. DPPIV, also known as T-cell activation 
antigen CD26, therefore plays an important role in the immune response via association 
5 with CD45 tyrosine phosphatase and, through its ability to bind adenosine deaminase 
(ADA) to the T-cell surface, protects the T-cell from adenosine-mediated inhibition of 
proliferation. Furthermore, the regulation of the function of chemokines by 
CD26/DPPIV appears to be essential for lymphocyte trafficking and infectivity of HIV 
strains. DPPIV has been associated with numerous functions including involvement in 
10 T-cell activation, cell adhesion, digestion of proline containing peptides in the kidney 
and intestines, HIV infection and apoptosis, and regulation of tumorigenicity in certain 
melanoma cells, Pethiyagoda et al., Clin. Exp. Metastasis 2000: 1815^:3 91-400. DPPIV 
is also implicated in the endocrine regulation and metabolic physiology. More 
particularly, DPPIV cleaves the amino-terminal His-Ala dipeptide of GLP-1, generating 
15 a GLP-1 receptor antagonist, and thereby shortens the physiological response to GLP-1. 
Glucagon-like peptide-1 (GLP-1), an incretin that induces glucose-dependent insulin 
secretion, is rapidly degraded by DPPIV, and since the half-life for DPPIV cleavage is 
much shorter than the half-life for removal of GLP-1 from circulation, a significant 
increase inGLP-1 bioactivity (5- to 10- fold) is anticipated from DPP-IV inhibition. 

20 Inhibitors of DPPIV are currently being studied in the clinic as potential therapeutic 
agents for type 2 diabetes and impaired glucose tolerance. 

Various different inhibitors of DPPIV were known in 1993. One of these is a 
suicide inhibitor N-Ala-Pro-O-(nitrobenzoyl-) hydroxylamine. Another is a competitive 
inhibitor: e-(4-nitro) benzoxycarbonyl-Lys-Pro, and another is a polyclonal rabbit anti- 
25 porcine kidney DPPIV immunoglobulin. Others have since been developed and are 

described in detail in U.S. Patents Nos. 5,939,560, 6,110,949m 6,011,155 and 5,462,928. 

In addition to, but independent of, its serine type catalytic activity, DPPIV binds 
closely to the soluble extracellular enzyme adenosine deaminase (ADA), acting as a 
receptor and is thought to mediate signal transduction. DPPIV structure is characterized 
30 by two extracellular domains, an a/p fold hydrolase domain and a 7-blade beta-propeller 
domain consisting of repeated beta sheets of about 50 amino acids. Recently it has been 
shown that, besides selecting substrates by size, the beta-propeller domain, containing 10 
of the 12 highly conserved cysteine residues, contributes to catalysis of the peptidase 
domain. In addition, the cysteine-rich domain is responsible for DPPIV-binding to 
35 collagen I and to extracellular ADA. DPPIV is also reported to play a role in fibronectin- 
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mediated interactions of cells with extracellular matrix. Recent studies show that the 
protease activity of DPPIV is not required for its anti-invasive activity because mutants 
of DPPIV that lack the extracellular serine protease activity maintain such activity. 

A number of proteins that share similarities with DPPIV have been reported in 
5 the literature. Several of these proteins have been cloned including DPP-1, DPP-II, 
DPP-in, DPP-X and fibroblast activation protein (FAP). These have been identified and 
characterised either by molecular cloning and functional studies of expressed proteins or 
as biochemical activities in tissue extracts. DPPIV-beta and other novel peptidases with 
functional similarities to DPPIV are not yet cloned. The identification, characterization 
10 and/or appropriate classification of further members of the family of prolyl 
oligopeptidases, the elucidation of their physiological (and particularly 
pathophysiological) role, and the application of that knowledge to the development of 
new therapeutic agents are significant challenges. 

Summary of the Invention 

15 The present invention provides proteins with prolyloligopeptidase (post-proline 

cleaving) activities that constitute three novel members of a family of proteins related to 
DPPIV, including the full-length proteins, alternative splice forms, subunits, and 
mutants, as well as nucleotide sequences encoding the same. The present invention also 
provides methods of screening for substrates, interacting proteins, agonists, antagonists 
20 or inhibitors of the above proteins, and furthermore to pharmaceutical compositions 

comprising the proteins and/or mutants, derivatives and/or analogues thereof and/or 
ligands thereto. 

These novel proteins having significant sequence homology to DPPIV are termed 
dipeptidyl peptidase IV-related protein-1, 2 & 3 (DPRP-1, DPRP-2 and DPRP-3). The 
25 amino acid sequences of DPRP-1, DPRP-2 and DPRP-3 are given in SEQ. ID NOS: 1, 3 
and 5 respectively. Further disclosed are nucleic acid sequences coding for these 
proteins (SEQ. ID NOS:2, 4 and 6). Table 1 illustrates the homology (i.e. similarity) 
between the novel proteins DPRP-1, DPRP-2 and DPRP-3 and other known serine 
proteases. 


- 4 - 


WO 02/31134 


PCT/US01/31874 


Table 1 - Comparison of the sequences of these three novel proteins with DPPIV 
and other Clan SC, Family S9 members and Subfamily B members 


Protease 

Family 

Protease 

name 

No. of a.a. 

Homology 
with DPPIV 

TM 

region 

Ser-Asp-His 

Triad 

Gene location 

I— 

■H 

Clan CA, 
Family Cl 

DPPI 

463 

N 

N 

N 

1 Iql4.1-ql4.3 

- 

Clan SC, 

Family S28 

DPPII 

500 

N 

Y 

N 

- 

4.5-6.0 

QPP 

492 

N 

N 

N 

- 

4.5-7.5 

P CP 

496 

N 

N 

N 

- 

- 

Unassigned 

DPPin 

737 

N 

N 

N 

- 

- 

Clan SC, 

Family 59, 
Subfamily B 

_ 

Dpprv 

766 

100 

Y 

Y 

2q24.3 

7.5-8.0 

DPPVI 

865 

52 

Y 

Mutation 

7 

- 

FA? 

760 

70 

Y 

Y 

2q23 

7.5-8.0 

DPRP-l 

882 

41 

N 

Y 

15q22.1-15q22.2 

7.5-8.0 

DPRP-2 

864 

39 ~ 

N 

Y 

19p 13.3 

7.5-8.0 

DPRP-3 

796 

54 

Y 

Mutation 

2q12.3-2q14.1 



The greatest homology between DPRP-1, DPRP-2 and DPPIV is seen in the C- 
terminal sequences. On the basis of sequence homology with DPPIV (see Figure 1), one 
15 might predict that these DPRP proteins would have functions that include, but are not 
limited to, roles as enzymes. Cloning, expression, biochemical and molecular 
characterization have confirmed this hypothesis. 

The expression pattern of DPRPs and the localization to specialized epithelial 
cells and plasma cells (Leydig cells, prostate epithelial cells, lymphocytes, B cells) is 
20 consistent with a role in differentiation, proliferation and inflammation. The localization 
of the DPRP-1 gene in hormone sensitive cancers (breast, prostate, testicular), tissues 
regulated by testosterone and the abundant expression in poorly differentiated cancers, 
demonstrate that DPRP-activating or inhibiting molecules will have numerous 
therapeutic applications in the treatment of disorders characterized by deregulated 
25 growth, differentiation and steroid or polypeptide hormone synthesis and degradation. 
Data disclosed herein supports the hypothesis that DPRP-1 and DPRP-2 are involved in 
the regulation of proliferation of in vitro models of prostate and testis cancer well known 
to those skilled in the art. 

DPRP-1 and DPRP-2 activities described herein and their expression patterns are 
30 compatible with their having functional roles as physiological regulators of the immune 
and neuroendocrine systems through the enzymatic modification of biochemical 
mediators like peptides and chemokines. The numerous functions previously described 
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for DPPIV based upon the use of inhibitors may be due in part to its action and that of 
similar proteins, like the DPRPs. Therefore, the discovery of selective and potent 
inhibitors of DPPIV, of the DPRPs and of other related proteases like FAP is considered 
central to achieving effective and safe pharmaceutical use of these and any newly 
5 identified serine protease inhibitors, as well as other active compounds that modify the 
function(s) of such proteins. 

The invention thus provides novel proteins or polypeptides, the nucleic acids 
coding therefor, cells which have been modified with the nucleic acid so as to express 
these proteins, antibodies to these proteins, a screening method for the discovery of new 
10 therapeutic agents which are inhibitors of the activity of these proteins (or which are 
inhibitors of DPPIV and not of the proteins), and therapeutic agents discovered by such 
screening methods. The novel proteins and the nucleic acids coding therefor can be used 
to discover new therapeutic agents for the treatment of certain diseases, such as for 
example, reproductive, inflammatory and metabolic disorders and also in the preparation 
15 of antibodies with therapeutic or diagnostic value. 

In accordance with one aspect of the present invention, there are provided novel, 
mature, biologically active proteins, principally of human origin. Such proteins may be 
isolated in small quantities from suitable animal (including human) tissue or biological 
fluids by standard techniques; however, larger quantities are more conveniently prepared 
20 in cultures of cells genetically modified so as to express the protein. 

In accordance with another aspect of the present invention, there are provided 
isolated nucleic acid molecules encoding polypeptides of the present invention including 
mRNAs, DNAs, cDNAs, genomic DNAs thereof. 

In accordance with a further aspect of the present invention, nucleic acid probes 
25 are also provided comprising nucleic acid molecules of sufficient length to specifically 
hybridize to a nucleic acid sequence of the present invention. 

In accordance with a still further aspect of the present invention, processes 
utilizing recombinant techniques are provided for producing such polypeptides useful for 
in vitro scientific research, for example, synthesis of DNA and manufacture of DNA 
30 vectors. Processes for producing such polypeptides include culturing recombinant 
prokaryotic and/or eukaryotic host cells that have been transfected with DNA vectors 
containing a nucleic acid sequence encoding such a polypeptide and/or the mature 
protein under conditions promoting expression of such protein and subsequent recovery 
of such protein or a fragment of the expressed product. 
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In accordance with still another aspect, the invention provides methods for using 
DPRP polypeptides and polynucleotides, including the treatment of infections, such as 
bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV-1 
or HIV-2, pain, diabetes, precocious puberty, infertility, obesity, anorexia, bulimia, 

5 Parkinson's disease, acute heart failure, hypotension, hypertension, urinary retention, 
osteoporosis, angina pectoris, myocardial infarction, stroke, ulcers, asthma, allergies, 
benign prostatic hypertrophy, cancers including hormone-sensitive and androgen- 
independent cancers, migraines, vomiting, psychotic and neurological disorders, 
including anxiety, schizophrenia, manic depression, depression, dementia, and severe 
10 mental retardation, and dyskinesias, hereinafter collectively referred to as "the Diseases". 

In accordance with yet another aspect of the present invention, there is provided a 
process for utilizing such polypeptides, or polynucleotides encoding such polypeptides, 
for the discovery of compounds that inhibit the biological activity of the mature proteins 

thereof, e.g. by cleaving an N-terminal dipeptide, and such inhibitors are thus also 
15 provided. 

In accordance with a more specific aspect, the invention provides iolated nucleic 
acid which encodes (a) a polypeptide which includes the amino acid sequence of one of 
SEQ ID NOS: 1, 3 and 5, or (b) a polypeptide having an amino acid sequence that is at 
least about 70% similar thereto and exhibits the same biological function, or which is an 
20 alternative splice variant of one of SEQ ID NOS:2,4 and 6, or which is a probe 

comprising at least 14 contiguous nucleotides from said nucleic acid encoding (a) or (b), 
or which is complementary to any one of the foregoing. 

In accordance with another specific aspect, the invention provides a polypeptide 
which may be optionally glycosylated, and which (a) has the amino acid sequence of a 
25 mature protein set forth in any one of SEQ ID NOS: 1, 3 and 5; (b) has the amino acid 
sequence of a mature protein having at least about 70% similarity to one of the mature 
proteins of (a) and which exhibits the same biological function; (c) has the amino acid 
sequence of a mature protein having at least about 90% identity with a mature protein of 
any of SEQ ID NOS: 1, 3 and 5; or (d) is an immunologically reactive fragment of (a). 

30 In accordance with still another specific aspect, the invention provides a method 

for the screening for a compound capable of inhibiting the enzymatic activity of at least 
one mature protein of the invention, which method comprises incubating said mature 
protein and a suitable substrate for said mature protein in the presence of one or more 
test compounds or salts thereof, measuring the enzymatic activity of said mature protein, 
35 comparing said activity with comparable activity determined in the absence of a test 
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compound, and selecting the test compound or compounds that reduce the enzymatic 
activity, and it also provides a method for screening for a compound capable of 
inhibiting the enzymatic activity of DPPIV that does not inhibit the enzymatic activity of 
at least one mature protein and a suitable substrate in the presence of one or more 
5 inhibitors of DPPIV or salts thereof, measuring the enzymatic activity of said mature 
protein, comparing said activity with comparable activity determined in the absence of 
the DPPIV inhibitor, and selecting a compound that does not reduce the enzymatic 
activity of said mature protein. 

These and other aspects of the present invention should be apparent to those 
10 skilled in the art from the detailed description which follows. 

Brief Description of the Drawings 

FIGS. 1A and IB show the co-linear alignment of DPRP-1, DPRP-2, DPRP-3 
and DPPIV, with shading being supplied to indicate the same (black) or similar (gray) 
amino acid residues at a particular location. 

15 FIG. 2 is similar to FIG. 1 and shows co-linear alignment of human and mouse 

DPRP-2. 

FIG. 3 is a graph which shows the effects of various tetrapeptide amide inhibitors 
on dipeptidyl peptidase enzyme activity. 

FIGS. 4A-4C show the effects of three inhibitor compounds on the proliferation 
20 of PC3 prostate cancer cell lines at various doses. 

Detailed Description of the Preferred Embodiments 

In accordance with an aspect of the present invention, there are provided isolated 
nucleic acid sequences (polynucleotides), which encode the mature polypeptides having 
the deduced amino acid sequences of the three DPRP’s (SEQ ID NOS:l, 3 and 5). 

25 The polynucleotides of this invention were discovered using a human testis 

cDNA library (DPRP-1), a human colon library (DPRP-2) and a human hypothalamus 
cDNA library (DPRP-3). Isolated nucleic acid for DPRP-1 contains an open reading 
frame encoding a protein of approximately 882 amino acids in length which is 
structurally related to human DPPIV, showing 26% identity, and 41% similarity over the 
30 entire human DPPIV protein sequence. Isolated nucleic acid for DPRP-2 contains an 
open reading frame encoding for a protein of approximately 864 amino acids, which is 
39% similar to the entire DPPIV amino acid sequence. Analysis of DPRP-1 and DPRP- 
2 primary amino acid sequence using hydrophobicity plots predicts that these two 
proteins do not have a transmembrane domain. Despite this fact, it is possible that these 
35 intracellular serine proteases are secreted upon cellular activation. Quiescent cell proline 
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dipeptidase (QPP) is a serine protease that is targeted to intracellular vesicles that are 
distinct from lysosomes (Chiravuri M, et al., J. Immunol. 2000 Nov 15;165(10):5695- 
702 ). This hypothesis expands the potential site(s) and scope of DPRP-1 and DPRP-2 
involvement in mechanisms for post-translational regulation of chemokines, cytokines, 
5 peptides and polypeptides. The full length DPRP-3 sequence contains 796 amino acids, 
a signal peptide from 1 to 48, and a transmembrane domain between 34 and 56. The 
mature protein is predicted to be a type II membrane protein and may be cleaved to 
produce a soluble form. The amino acid sequence is set forth in SEQ ID NO:5 , which 
was deduced from SEQ ID NO:6 and has 54% similarity with DPPIV. 

10 Amino acid sequence alignments of these polypeptides with members of the 

prolyloligopeptidase enzyme subfamily S9B show that all three DPRP proteins have 
overall sequence and structural homology to DPPIV and FAP. DPRPs are predicted to 
be a members of the enzyme Clan SC (Serine nucleophile) with catalytic residues in the 
order Ser, Asp, His and the active site sequence (G-W-S-Y-G). 

15 Table 2. Homology (i.e. similarity) between DPRP-1, DPRP-2, DPRP-3 and members 
of the prolyl oligopeptidase family S9B enzymes. 

DPPIV 


41 

DPRP-1 




39 

74 

DPRP-2 



54 

39 

40 

DPRP-3 


70 

41 

39 

52 

FAP 

52 

40 

42 

68 

54 

DPPVI 


DPRP-1, DPRP-2 and DPRP-3 do not exhibit sequence similarity with any 
members of the classical serine protease families, chymotrypsin and subtilisin. The order 
20 of the catalytic triad residues is different in the three main related SC clan families: His- 
Asp-Ser in chymotrypsin, Asp-His-Ser in subtilisin and Ser-Asp-His in the prolyl 
oligopeptidases. 

As shown in Table 2, DPRP-3 has the highest homology with DPPVI (68% 
homology and 51% identity). Wada et al isolated cDNA clones for DPPVI, a DPPIV- 
25 related protein, from bovine, rat (Wada et al., Proc. Nat. Acad. Sci. 89 : 197-201. (1992)) 
and human (Yokotani et al., Hum. Molec. Genet. 2 :1037-1039 (1993)) brain libraries. 
They demonstrated that, unlike DPPIV, the catalytic triad in DPPVI does not have the 
first serine residue. In DPRP-3 two of the amino acids in the catalytic triad 
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characteristic of the serine protease family are conserved. However, the serine residue 
itself is replaced by glycine. While the absence of the serine residue is likely to prevent 
protease activity at this site, it is possible that multiple other functions mediated by other 
functional domains of the protein remain intact. 

5 As briefly described above, DPPIV is a multifunctional molecule that exerts 

important functions depending on the expressed cells and tissues, in addition to its 
catalytic activity as a peptidase. DPRP-3 and DPPVI are also likely to maintain multiple 
functions despite the absence of an intact catalytic triad. For example, DPPVI has been 
implicated in the regulation of neuronal plasticity. DPPVI is highly expressed in the 
10 hippocampus, thalamus, hypothalamus and stiatum. In addition, developmental arrest 
and embryonic lethality of rump white Rw/Rw embryos is thought to be due to disruption 
of the DPPIV gene. Rw mutation is associated with a chromosomal inversion spanning 
30 cM of the proximal portion of mouse chromosome 5. Genomic analysis of the DPPVI 
gene on the Rw chromosome places the inversion breakpoint in the coding region 
15 resulting in loss of a significant fraction of the C-terminal region. Hough R.B. et al., 
Proc. Nat. Acad. Sci.. 95. 13800-13805 (1998). 

The human DPRP-1 gene, predicted to be 32668bp in length, has at least 22 
exons and eight transcripts. It maps to chromosome 15 (NT_010265) at position 
15q21.1 - 15q22.1. The lengths of predicted alternative splice variant transcripts vary 
20 between 602bp and 4523bp (see SEQ ID NOS: 7-22). This is in agreement with the 
multiple transcripts observed by Northern blot analysis (See Example 2). ESTs 
representing the transcripts were found in numerous tissues including senescent 
fibroblasts, T-lymphocytes, germinal center B-cells, germ cell seminoma, testis, 
melanocytes, uterus, ovary breast, multiple sclerosis lesions, pancreas and placenta. 

25 Human DPRP-2 belongs to a gene with at least 27 exons and nine splice variants 

(see SEQ ID NOS.23-40). One SNP was observed in the 3' UTR. (88% (37) C vs. 12% 
(5) T). The DPRP-2 gene maps to region 19pl3.3 of chromosome 19. This location is 
host to a number of disease markers and is associated with various disorders including 
hypocalciuric hypercalcemia, type II cerebellar ataxia, muscular dystrophy, convulsions, 
30 susceptibility to atherosclerosis, psoriasis, ectodermal dysplasia, and acute myeloid 
leukemia. In agreement with the ubiquitous distribution of the mRNA observed by 
Northern blot analysis (see Example 2), DPRP-2 was expressed in a wide variety of 
tissues upon examination of EST’s coverage (e.g. over 64 EST’s expressed in liver, 
spleen, muscle, melanocytes, heart, lung, placenta, skin, pancreas, stomach, brain 
35 parathyroid gland). 
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Human DPRP-3 belongs to a gene with at least 23 exons and two splice variants 
(see SEQ ED NOS:4I-44). The gene maps to chromosome 2 (NT 005445) at position 
2ql2.3-2ql4.1. Transcripts for DPRP-3 did not show as wide a distribution as DPRP-1 
and DPRP-2. As shown by Northern blot in Example 2, DPRP-3 expression is restricted 
5 to brain and pancreas. ESTs representing the DPRP-3 mRNA were abundant in tissue 
derived from multiple sclerosis lesions, hypothalamus, whole brain and nerves, with a 
few transcripts being found in uterus and colon. 

The relationships among human and rodent proteases in clan SC, including 
DPRP-1 DPRP-2 and DPRP-3, were analyzed using Neighbor Joining method (NJ), see 
10 Saitou and Nei, Mol. Biol. EvoL 4. 406-525 (1987). Phylogenetic analysis shows that 
among the S9 proteases, DPRP-1 and DPRP-2, both lacking a transmembrane domain, 
are distinguished from DPPIV and its closely related proteins like FAP. Similarity is 
shown however between DPPIV and FAP and between DPRP-3 and DPPVI, which are 
all type II membrane proteins. 

15 A database search for additional DPRP-related genes revealed the presence of a 

murine sequence related to DPRP-1. Alignment of this mouse sequence with the novel 
human proteases shows that the mDPRP-1 displays considerable homology with its 
human counterpart (FIG. 2). One skilled in the art will readily recognize that the novel 
mouse protease gene can be isolated using the sequence information disclosed herein and 
20 can be readily incorporated into one of the routinely used expression constructs which 
are well known in the art. Use of this disclosed sequence by those skilled in the art to 
generate a transgenic mouse model will employ development of gene-targeting vectors, 
for example, that result in homologous recombination in mouse embryonic stem cells. 

The use of knockout mice in further analysis of the function of DPRP genes is a valuable 
25 tool. 

The polynucleotides of the present invention may be in the form of RNA or in 
the form of DNA; DNA should be understood to include cDNA, genomic DNA, and 
synthetic DNA. The DNA may be double-stranded or single-stranded and, if single- 
stranded, may be the coding strand or non-coding (antisense) strand. The coding 
30 sequence which encodes the mature polypeptide may be identical to the coding sequence 
shown in SEQ ID NOS:2, 4 and 6 respectively, or it may be a different coding sequence 
encoding the same mature polypeptide, as a result of the redundancy or degeneracy of 
the genetic code or a single nucleotide polymorphism. For example, it may also be an 
RNA transcript which includes the entire length of any one of SEQ ID NOS:2, 4 and 6. 


WO 02/31134 


PCT/US01/31874 


The polynucleotides which encode the mature proteins of SEQ ID NOS:l, 3, 5, 
respectively, may include but are not limited to the coding sequence for the mature 
protein alone; the coding sequence for the mature polypeptide plus additional coding 
sequence, such as a leader or secretory sequence or a proprotein sequence; and the 
5 coding sequence for the mature protein (and optionally additional coding sequence) plus 
non-coding sequence, such as introns or a non-coding sequence 5' and/or 3' of the coding 
sequence for the mature protein. 

Thus, the term "polynucleotide encoding a polypeptide" or the term "nucleic acid 
encoding a polypeptide" should be understood to encompass a polynucleotide or nucleic 
10 acid which includes only coding sequence for the mature protein as well as one which 
includes additional coding and/or non-coding sequence. The terms polynucleotides and 
nucleic acid are used interchangeably. 

The present invention also includes polynucleotides where the coding sequence 
for the mature protein may be fused in the same reading frame to a polynucleotide 
15 sequence which aids in expression and secretion of a polypeptide from a host cell; for 
example, a leader sequence which functions as a secretory sequence for controlling 
transport of a polypeptide from the cell may be so fused. The polypeptide having such a 
leader sequence is termed a preprotein or a preproprotein and may have the leader 
sequence cleaved, by the host cell to form the mature form of the protein. These 
20 polynucleotides may have a 5' extended region so that it encodes a proprotein, which is 
the mature protein plus additional amino acid residues at the N-terminus. The 
expression product having such a prosequence is termed a proprotein, which is an 
inactive form of the mature protein; however, once the prosequence is cleaved an active 
mature protein remains. Thus, for example, the polynucleotides of the present invention 
25 may encode mature proteins, or proteins having a prosequence, or proteins having both a 
prosequence and a presequence (leader sequence). 

The polynucleotides of the present invention may also have the coding sequence 
fused in frame to a marker sequence which allows for purification of the polypeptides of 
the present invention. The marker sequence may be a polyhistidine tag, a hemagglutinin 
30 (HA) tag, a c-myc tag or a V5 tag when a mammalian host, e.g. COS-1 cells, is used. 

The HA tag would correspond to an epitope derived from the influenza hemagglutinin 
protein (Wilson, I., et al.. Cell. 37 :767 (1984)), and the c-myc tag may be an eptitope 
from human Myc protein (Evans, G.I. et al.. Mol. Cell. Biol. 5: 3610-3616 (1985)). 

The term "gene" means the segment of DNA involved in producing a polypeptide 
35 chain; it includes regions preceding and following the coding region (leader and trailer) 
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as well as intervening sequences (introns) between individual coding segments (exons). 
The term “significant sequence homology” is intended to denote that at least 25%, 
preferably at least 40%, of the amino acid residues are conserved, and that, of the non- 
conserved residues, at least 40% are conservative substitutions. 

5 Fragments of the full-length genes of the present invention may be used as a 

hybridization probe for a cDNA library to isolate full-length cDNA as well as to isolate 
other cDNAs which have significant sequence homology to the gene and will encode 
proteins or polypeptides having similar biological activity or function. By similar 
biological activity or function, for purposes of this application, is meant the ability to 
10 cleave an N-terminal dipeptide having Ala or Pro as the penultimate residue or other 
amino acids. Such a probe of this type has at least 14 bases (at least 14 contiguous 
nucleotides from one of SEQ ID NOS:2, 4 or 6), preferably at least 30 bases, and such 
may contain, for example, 50 or more bases. Such probe may also be used to identify a 
cDNA clone corresponding to a full-length transcript and/or a genomic clone or clones 
15 that contains the complete gene, including regulatory and promoter regions, exons, and 
introns. Labelled oligonucleotides having a sequence complementary to that of the gene 
of the present invention are useful to screen a library of human cDNA, genomic DNA or 
mRNA to locate members of the library to which the probe hybridizes. As an example, a 
known DNA sequence may be used to synthesize an oligonucleotide probe which is then 
20 used in screening a library to isolate the coding region of a gene of interest. 

The present invention is considered to further provide polynucleotides which 
hybridize to the hereinabove-described sequences wherein there is at least 70%, 
preferably at least 90%, and more preferably at least 95% identity or similarity between 
the sequences, and thus encode proteins having similar biological activity. Moreover, as 
25 known in the art, there is “similarity” between two polypeptides when the amino acid 
sequences contain the same or conserved amino acid substitutes for each individual 
residue in the sequence. Identity and similarity may be measured using sequence 
analysis software (e.g., Sequence Analysis Software Package of the Genetics Computer 
Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, 

30 Madison, WI 53705). The present invention particularly provides such polynucleotides 
which hybridize under stringent conditions to the hereinabove-described 
polynucleotides. As herein used, the term "stringent conditions" means conditions which 
permit hybridization between polynucleotides sequences and the polynucleotide 
sequences of SEQ ID NOS:2, 4 and 6 where there is at least about 70% identity. 

35 Suitably stringent conditions can be defined by, e.g., the concentrations of salt or 
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formamide in the prehybridization and hybridization solutions, or by the hybridization 
temperature, and are well known in the art. In particular, stringency can be increased by 
reducing the concentration of salt, by increasing the concentration of formamide, and/or 
by raising the hybridization temperature. 

5 For example, hybridization under high stringency conditions may employ about 

50% formamide at about 37°C to 42°C, whereas hybridization under reduced stringency 
conditions might employ about 35% to 25% formamide at about 30°C to 35°C. One 
particular set of conditions for hybridization under high stringency conditions employs 
42°C, 50% formamide, 5x. SSPE, 0.3% SDS, and 200 pg/ml sheared and denatured 
10 salmon sperm DNA. For hybridization under reduced stringency, similar conditions as 
described above may be used in 35% formamide at a reduced temperature of 35°C. The 
temperature range corresponding to a particular level of stringency can be further 
narrowed by calculating the purine to pyrimidine ratio of the nucleic acid of interest and 
adjusting the temperature accordingly. Variations on the above ranges and conditions 
15 are well known in the art. Preferably, hybridization should occur only if there is at least 
95%, and more preferably at least 97%, identity between the sequences. The 
polynucleotides which hybridize to the hereinabove described polynucleotides in a 
preferred embodiment encode polypeptides which exhibit substantially the same 
biological function or activity as the mature protein encoded by one of the cDNAs of 
20 SEQ ID NOS:2,4 and 6. 

As mentioned, a suitable polynucleotide probe may have at least 14 bases, 
preferably 30 bases, and more preferably at least 50 bases, and will hybridize to a 
polynucleotide of the present invention which has an identity thereto, as hereinabove 
described, and which may or may not retain activity. For example, such polynucleotides 
25 may be employed as a probe for hybridizing to the polynucleotides of SEQ ID NOS:2, 4 
and 6 respectively, for example, for recovery of such a polynucleotide, or as a diagnostic 
probe, or as a PCR primer. Thus, the present invention includes polynucleotides having 
at least a 70% identity, preferably at least a 90% identity, and more preferably at least a 
95% identity to a polynucleotide which encodes the polypeptides of SEQ ID NOS:l, 3 
30 and 5 respectively, as well as fragments thereof, which fragments preferably have at 
least 30 bases and more preferably at least 50 bases, and to polypeptides encoded by 
such polynucleotides. 

As is well known in the art, the genetic code is redundant in that certain amino 
acids are coded for by more than one nucleotide triplet (codon), and the invention 
35 includes those polynucleotide sequences which encode the same amino acids using a 
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different codon from that specifically exemplified in the sequences herein. Such a 
polynucleotide sequence is referred to herein as an "equivalent" polynucleotide 
sequence. The present invention further includes variants of the hereinabove described 
polynucleotides which encode for fragments, such as part or all of the mature protein, 

5 analogs and derivatives of one of the polypeptides having the deduced amino acid 

sequence of SEQ ID NOS:l, 3 and 5 respectively. The variant forms of the 
polynucleotides may be a naturally occurring allelic variant of the polynucleotides or a 
non-naturally occurring variant of the polynucleotides. For example, the variant in the 
nucleic acid may simply be a difference in codon sequence for the amino acid resulting 
10 from the degeneracy of .the genetic code, or there may be deletion variants, substitution 
variants and addition or insertion variants. As known in the art, an allelic variant is an 
alternative form of a polynucleotide sequence which may have a substitution, deletion or 
addition of one or more nucleotides that does not substantially alter the biological 
function of the encoded polypeptide. 

15 The present invention further includes polypeptides which have the deduced 

amino acid sequence of SEQ ID NOS:l, 3 and 5, as well as fragments, analogs and 
derivatives of such polypeptides. The terms "fragment," "derivative" and "analog", 
when referring to the polypeptides of SEQ ID NOS; 1,3 and 5, means polypeptides that 
retain essentially the same biological function or activity as such polypeptides. An 
20 analog might, for example, include a proprotein which can be activated by cleavage of 
the proprotein portion to produce an active mature protein. The polypeptides of the 
present invention may be recombinant polypeptides, natural polypeptides or synthetic 

polypeptide; however, they are preferably recombinant polypeptides, glycosylated or 
unglycosylated. 

25 The fragment, derivative or analog of a polypeptide of SEQ ID NOS: 1,3 and 5 

respectively, may be (i) one in which one or more of the amino acid residues is 
substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not 
be one encoded by the genetic code, or (ii) one in which one or more of the amino acid 
30 residues includes a substituent group, or (iii) one in which additional amino acids are 
fused to the mature protein, such as a leader or secretory sequence or a sequence which 
is employed for purification of the mature polypeptide or a proprotein sequence. Such 
fragments, derivatives and analogs are deemed to be within the scope of those skilled in 
the art to provide upon the basis of the teachings herein. 
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The polypeptides and polynucleotides of the present invention should be in an 
isolated form, and preferably they are purified to substantial homogeneity or purity. By 
substantial homogeneity is meant a purity of at least about 85%. 

The term "isolated" is used to mean that the material has been removed from its 
original environment (e.g., the natural environment if it is naturally occurring). For 
example, a naturally occurring polynucleotide or polypeptide present in a living animal 
is not considered to be isolated, but the same polynucleotide or polypeptide, when 
separated from substantially all of the coexisting materials in the natural system, is 
considered isolated. For DNA, the term includes, for example, a recombinant DNA 
which is incorporated into a vector, into an autonomously replicating plasmid or virus, or 
into the genomic DNA of a prokaryote or eukaryote; or which exists as a separate 
molecule (e.g., a cDNA or a genomic or cDNA fragment produced by polymerase chain 
reaction (PCR) or restriction endonuclease digestion) independent of other sequences. it 
also includes a recombinant DNA which is part of a hybrid gene encoding additional 
polypeptide sequence, e.g., a fusion protein. Further included is recombinant DNA 
which includes a portion of the nucleotides shown in one of SEQ ID NO:2,4 or 6 which 
encodes an alternative splice variant of the DPRP. Various alternative splice variants are 
exemplified in SEQ ID NOS:8,10,12, 14, 16,18,20,22, 24, 26, 28, 30,32, 34, 36, 38, 


40, 42, 44 and 46. 

The polypeptides of the present invention include any one of the polypeptide of 
SEQ ID NOS:l, 3 and 5 (in particular the mature protein), as well as polypeptides which 
have at least 70% similarity (e.g. preferably at least 60% and more preferably at least 
70% identity) to one of the polypeptides of SEQ ID NOS: 1,3 and 5, more preferably at 
least 90% similarity (e.g. preferably at least 90% identity) to one of the polypeptides of 
SEQ ID NOS:l, 3 and 5, and most preferably at least 95% similarity (e.g. preferably at 
least 95% identity) to one of the polypeptides of SEQ ED NOS: 1, 3 and 5. Moreover, 
they should preferably include exact portions of such polypeptides containing a sequence 
of at least 30 amino acids, and more preferably at least 50 amino acids. 

Fragments or portions of the polypeptides of the present invention may be 
employed as intermediates for producing the corresponding full-length polypeptides by 
peptide synthesis. Fragments or portions of the polynucleotides of the present invention 
may also be used to synthesize full-length polynucleotides of the present invention. 

The present invention also includes vectors which include such polynucleotides, 
host cells which are genetically engineered with such vectors and the production of 
polypeptides by recombinant techniques using the foregoing. Host cells are genetically 
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engineered (transduced or transformed or transfected) with such vectors which may be, 
for example, a cloning vector or an expression vector. The vector may be, for example, 
in the form of a plasmid, a viral particle, a phage, etc. The engineered host cells can be 
cultured in conventional nutrient media modified as appropriate for activating promoters, 
5 selecting transformants or amplifying the genes of the present invention. The culture 
conditions, such as temperature, pH and the like, are those commonly used with the host 
cell selected for expression, as well known to the ordinarily skilled artisan. 

The polynucleotides of the present invention may be employed for producing 
polypeptides by recombinant techniques. Thus, for example, the polynucleotides may be 
10 included in any one of a variety of expression vectors for expressing polypeptides. Such 
vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; 
vectors derived from combinations of plasmids and phage DNA, viral DNA such as 
vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, any other vector may 
15 be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by any of a 
variety of procedures. In general, the DNA sequence is inserted into an appropriate 
restriction endonuclease site(s) by procedures well known in the art, which procedures 
are deemed to be within the scope of those skilled in this art. 

20 The DNA sequence in the expression vector is operatively linked to an 

appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. As 
representative examples of such promoters, there may be mentioned: LTR or SV40 
promoter, the E. coli. lac or tip, the phage lambda P.sub.L promoter and other promoters 
known to control expression of genes in prokaryotic or eukaryotic cells or their viruses. 

25 The expression vector should also contain a ribosome binding site for translation 
initiation and a transcription terminator. The vector may also include appropriate 
sequences for amplifying expression. In addition, the expression vectors preferably 
contain one or more selectable marker genes to provide a phenotypic trait for selection 
of transformed host cells, such as dihydrofolate reductase or neomycin-resistance for 
30 eukaryotic cell culture, or such as tetracycline- or ampicillin-resistance in E. coli. 

The vector containing the appropriate DNA sequence as hereinabove described, 
as well as an appropriate promoter or control sequence, may be employed to transform 
an appropriate host to permit the host to express the protein. As representative examples 
of appropriate hosts, there may be mentioned: bacterial cells, such as E. coli , 

35 Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; insect cells, such as 
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Drosophila S2 and Spodoptera Sf9; animal cells, such as CHO, COS or Bowes 
melanoma; adenoviruses; plant cells, etc. The selection of an appropriate host is deemed 
to be within the scope of those skilled in the art from the teachings herein. 

Synthetic production of nucleic acid sequences is well known in the art as is 
apparent from CLONTECH 95/96 Catalogue, pages 215-216, CLONTECH, 1020 East 
Meadow Circle, Palo Alto, Calif. 94303. Thus, the present invention also includes 
expression vectors useful for the production of the proteins of the present invention 

The present invention further includes recombinant constructs comprising one or 
more of the sequences as broadly described above. The constructs may comprise a 
vector, such as a plasmid or viral vector, into which a sequence of the invention has been 
inserted, in a forward or reverse orientation. In a preferred aspect of this embodiment, 
the construct further comprises regulatory sequences, including, for example, a 
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promoters are known to those of skill in the art, and are commercially available. The 
following vectors are provided by way of example: Bacterial: pQE70, pQE60, pQE-9 
(Qiagen), pBS, pDIO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16a, 
pNHISA, pNH46A (Stratagene), ptrc99a, pKK223-3, pKK233-3, pDR540 and pRIT5 
(Pharmacia); and Eukaryotic: pWLNEO, pSV2CAT, pOG44, pXTl, pSG (Stratagene) 
pSVK3, pBPV, pMSG, and pSVL (Pharmacia). However, any other suitable plasmid or 
vector may be used as long as it is replicable and viable in the host. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol acetyl transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lacl, lacZ, T3, T7, gpt, lambda P.sub.R, P.sub.L and trp. Eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, 
LTRs from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector 
and promoter is well within the level of ordinary skill in the art. 

Components of the expression vector may generally include: 1) a neomycin 
phosphotransferase (G418), or hygromycin B phosphotransferase (hyg) gene as a 
selection marker, 2) an E. coli origin of replication, 3) a T7 and SP6 phage promoter 
sequence, 4) lac operator sequences, 5) the lactose operon repressor gene (laclq) and 6) a 
multiple cloning site linker region. Such an origin of replication (oriC) may be derived 
from pUC19 (LTI, Gaithersburg, Md.). 

A nucleotide sequence encoding one of the polypeptides SEQ ID NOS:2,4 and 6 
having the appropriate restriction sites is generated, for example, according to the PCR 
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protocol described in Example 1 hereinafter, using PCR primers having restriction sites 
for Kpnl (as the 5' primer) and Nod or SacI (as the 3’ primer) for DPRP-1, or sites for 
Hindlll (as the 5’ primer) and Notl or BamHI (as the 3’ primer) for DPRP-2. The PCR 
inserts are gel-purified and digested with compatible restriction enzymes. The insert and 
5 vector are ligated according to standard protocols. 

In a further embodiment, the present invention provides host cells containing the 
above-described constructs. The host cell can be a higher eukaryotic cell, such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell 
10 can be effected by calcium phosphate transfection, DEAE-Dextran mediated 

transfection, Iipofection or electroporation (Davis, L., Dibner, M., Battey, I., Basic 
Methods in Molecular Biology, (1986)). 

Such constructs in host celis are preferably used in a conventional manner to 
produce the gene product encoded by the recombinant sequence. Alternatively, the 
15 polypeptides of the invention can be synthetically produced by conventional peptide 
synthesizers or by chemical ligation of suitable fragments thus prepared. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other 
cells under the control of appropriate promoters. Cell-free translation systems can also 
be employed to produce such proteins using RNAs derived from the DNA constructs of 
20 the present invention. Appropriate cloning and expression vectors for use with 

prokaryotic and eukaryotic hosts are described by Sambrook, et al., Molecular Cloning: 

A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989). 

Transcription of the DNA encoding the polypeptides of the present invention by 
higher eukaryotes is increased by inserting an enhancer sequence into the vector. 

25 Enhancers include cis-acting elements of DNA, usually about from 10 to 300 bp, that act 
on a promoter to increase its transcription. Examples include the SV40 enhancer on the 
late side of the replication origin bp 100 to 270, a cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus 
enhancers. 

30 Generally, recombinant expression vectors will include origins of replication and 

selectable markers permitting transformation of the host cell, e.g., the ampicillin- 
resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a 
highly expressed gene to direct transcription of a downstream structural sequence. Such 
promoters can be derived from operons encoding glycolytic enzymes, such as 3- 
35 phosphoglycerate kinase (PGK), alpha-factor, acid phosphatase, or heat shock proteins, 
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among others. The heterologous structural sequence is assembled in appropriate phase 
with translation initiation and termination sequences, and preferably, a leader sequence 
capable of directing secretion of translated protein into the periplasmic space or 
extracellular medium. Optionally, the heterologous sequence can encode a fusion 
protein including an N-terminal identification peptide imparting desired characteristics, 
e.g., stabilization or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable translation 
initiation and termination signals in operable reading phase with a functional promoter. 
The vector will comprise one or more phenotypic selectable markers and an origin of 
replication to ensure maintenance of the vector and to, if desired, provide amplification 
within the host. Suitable prokaryotic hosts for transformation include E. coli. Bacillus 
subtiUs, Salmonella typhimurium and various species within the genera Pseudomonas, 
Streptomyces, and Staphylococcus, although others may also be employed as a matter of 
choice. 


As a representative but non-limiting example, useful expression vectors for 
bacterial use can comprise a selectable marker and bacterial origin of replication derived 
from commercially available plasmids comprising genetic elements of the well known 
cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, 
20 pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotec, 

Madison, Wis., U.S.A.). These pBR322 "backbone" sections are combined with an 
appropriate promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g., 
25 temperature shift or chemical induction), and cells are cultured for an additional period. 
Cells are typically harvested by centrifugation and then disrupted by physical or 
chemical means, with the resulting crude extract being retained for further purification. 
Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption and use of 
30 cell-lysing agents; such methods are well known to those skilled in the art. 

Various mammalian cell culture systems can also be employed to express a 
recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts, described by Gluzman, Cell. 23 :175 (1981). Other 
cell lines capable of expressing a compatible vector include, for example, the Cl 27, 3T3, 
35 CHO, HeLa and BHK cell lines. Mammalian expression vectors will generally comprise 
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an origin of replication, a suitable promoter and enhancer, and also any necessary 
ribosome binding sites, polyadenylation site, splice donor and acceptor sites, 
transcriptional termination sequences, and 5' flanking nontranscribed sequences. DNA 
sequences derived from the SV40 splice, and polyadenylation sites may be used to 
5 provide required nontranscribed genetic elements. 

The polypeptides can be recovered and purified from recombinant cell cultures 
by methods including ammonium sulfate or ethanol precipitation, acid extraction, anion 
or cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography 
10 and lectin chromatography. Recovery can be facilitated if the polypeptide is expressed 
at the surface of the cells, but such is not a prerequisite. Recovery may also be desirable 
of cleavage products that are cleaved following expression of a longer form of the 
polypeptide. Protein refolding steps as known in this art can be used, as necessary, to 
complete configuration of the mature protein. High performance liquid chromatography 
15 (HPLC) can be employed for final purification steps. 

The polypeptides of the present invention may be purified natural products, or 
produced by recombinant techniques from a prokaryotic or eukaryotic host (for example, 
by bacterial, yeast, higher plant, insect or mammalian cells in culture). Depending upon 
the host employed in a recombinant production procedure, the polypeptides of the 
20 present invention may be glycosylated or may be non-glycosylated. Polypeptides of the 
invention may also include an initial methionine amino acid residue. 

In a preferred embodiment, the proteins of the invention are isolated and purified 
so as to be substantially free of contamination from other proteins. For example, the 
proteins of the invention should constitute at least 80% by weight of the total protein 
25 present in a sample, more preferably at least 90%, even more preferably at least 95%, 
and most preferably at least 98% by weight of the total protein. 

These proteins may be in the form of a solution in water, another suitable solvent, 
such as dimethyl sulphoxide (DMSO) or ethanol, or a mixture of suitable solvents. 
Examples of mixtures of solvents include 10% (by weight) ethanol in water and 2% (by 
30 weight) DMSO in water. A solution may further comprise salts, buffering agents, 

chaotropic agents, detergents, preservatives and the like. Alternatively, the proteins may 
be in the form of a solid, such as a lyophilised powder or a crystalline solid, which may 
also comprise a residual solvent, a salt or the like. 

As used herein, the term "antibodies" includes polyclonal antibodies, 

35 affinity-purified polyclonal antibodies, monoclonal antibodies, and antigen-binding 
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fragments, such as F(ab’) 2 and Fab' proteolytic fragments. Genetically engineered intact 
antibodies or fragments, such as chimeric antibodies, Fv fragments, single chain 
antibodies and the like, as well as synthetic antigen-binding peptides and polypeptides, 
are also included. Non-human antibodies may be humanized by grafting non-human 
5 CDRs onto human framework and constant regions, or by incorporating the entire 
non-human variable domains (optionally "cloaking" them with a human-like surface by 
replacement of exposed residues, wherein the result is a "veneered" antibody). In some 
instances, humanized antibodies may retain non-human residues within the human 
variable region framework domains to enhance proper binding characteristics. Through 
10 humanizing antibodies, biological half-life may be increased, and the potential for 
adverse immune reactions upon administration to humans should be reduced. 

Alternative techniques for generating or selecting antibodies useful herein 
include in vitro exposure of lymphocytes to human piohornione DFRP protein or a 
peptide therefrom, and selection of antibody display libraries in phage or similar vectors 
15 (for instance, through use of immobilized or labeled human DPRP protein or peptide). 
Genes encoding polypeptides having potential human DPRP polypeptide binding 
domains can be obtained by screening random peptide libraries displayed on phage 
(phage display) or on bacteria, such as E. coli. Nucleotide sequences encoding such 
polypeptides can be obtained in a number of ways well known in this art. 

20 As would be evident to one of ordinary skill in the art, polyclonal antibodies can 

be generated from inoculating a variety of warm-blooded animals, such as horses, cows, 
goats, sheep, dogs, chickens, rabbits, mice and rats, with a human DPRP polypeptide or 
a fragment thereof. The immunogenicity of a human prohormone DPRP polypeptide 
may be increased through the use of an adjuvant, such as alum (aluminum hydroxide) or 
25 Freund's complete or incomplete adjuvant, or surface active substances, such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH or 
dinitrophenol. Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and 
Corynebacterium parvum are especially preferable. Polypeptides useful for 
immunization also include fusion polypeptides, such as fusions of DPRP or a portion 
30 thereof with an immunoglobulin polypeptide or with maltose binding protein. The 
polypeptide immunogen may be a full-length molecule or a portion thereof. If the 
polypeptide portion is "hapten-like", such portion may be advantageously joined or 
linked to a macromolecular carrier, such as keyhole limpet hemocyanin (KLH), bovine 
serum albumin (BSA) or tetanus toxoid, for immunization. Antibodies to DPRP may 
35 also be generated using methods that are well known in the art. Such antibodies may 
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include, but are not limited to, polyclonal, monoclonal, chimeric, and single chain 
antibodies, Fab fragments, and fragments produced by a Fab expression library. 

Neutralizing antibodies (i.e., those which block or modify interactions at the active sites) 
are especially preferred for therapeutic use. 


5 



15 


For the production of antibodies, binding proteins, or peptides which bind 
specifically to DPR?, libraries of single chain antibodies, Fab fragments, other antibody 
fragments, non-antibody protein domains, or peptides may be screened. The libraries 


could be generated using phage display, other recombinant DNA methods, or peptide 
synthesis (Vaughan, T. J. et al. Nature Biotechnology 14 - 309-314 (1966)). Such 
libraries would commonly be screened using methods which are well known in the art to 
identify sequences which demonstrate specific binding to DPRP. 

It is preferred that the oligopeptides, peptides, or fragments used to induce 


antibodies to DPRP have an amino acid sequence consisting of at least about 5 amino 
acids and, more preferably, of at least about 10 amino acids. It is also preferable that 
these oligopeptides, peptides, or fragments are identical to a portion of the amino acid 


sequence of the natural protein. Short stretches of DPRP amino acids may also be fused 

with those of another protein, such as KLH, and antibodies to the chimeric molecule may 
be produced. 


Monoclonal antibodies to DPRP may be prepared using any well known 
20 technique which provides for the production of antibody molecules by continuous cell 
lines in culture. These include, but are not limited to, the hybridoma technique, the 
human 6-cell hybridoma technique, and the EBV-hybridoma technique, although 
monoclonal antibodies produced by hybridoma cells may be preferred. 

In addition, techniques developed for the production of "chimeric antibodies", 
25 such as the splicing of mouse antibody genes to human antibody genes to obtain a 
molecule with appropriate antigen specificity and biological activity, can be used, see 
Neuberger, M.S. et al. Nature 312: 604-608 (1984). Alternatively, techniques described 
for the production of single chain antibodies may be adapted, using methods known in 
the art, to produce DPRP-specific single chain antibodies. Antibodies with related 
30 specificity, but of distinct idiotypic composition, may be generated by chain shuffling 

from random combinatorial immunoglobulin libraries. (Burton D. R. Proc. Natl. Acad 
Sci. 88 : 11120-11123 (1991)). 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening immunoglobulin libraries or panels of highly 
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specific binding reagents as disclosed in the literature. (Orlandi, R. et al. Proc. Natl. 
Acad. Sci. 86 : 3833-3837 (1989)). 

Antibody fragments which contain specific binding sites for DPRP may also be 
generated. For example, such fragments include, but are not limited to, F(ab') 2 
fragments produced by pepsin digestion of the antibody molecule and Fab fragments 
generated by reducing the disulfide bridges of the F(ab') 2 fragments. Alternatively, Fab 
expression libraries may be constructed to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. (Huse, W. D. et al. Science 254 : 


1275-1281 (1989)). 

Various immunoassays may be used to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well 
known in the art. Such immunoassays typically involve the measurement of complex 
formation between DPRP and its specific antibody. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two non-interfering DPRP 
epitopes is preferred, but a competitive binding assay may also be employed. 

As earlier mentioned, the DPRPs can be used in treatment of the Diseases. 
Pharmaceutical compositions suitable for use in this aspect of the invention include 
compositions wherein the active ingredients are contained in an effective amount to 
achieve the intended purpose relating to one of the Diseases. The determination of a 
therapeutically effective dose is well within the capability of those skilled in the art and 
can be estimated initially either in cell culture assays, e.g. of neoplastic cells, or in 
animal models, usually mice, rats, rabbits, dogs, or pigs. An animal model may also be 
used to determine the appropriate concentration range and route of administration, which 
information is then commonly used to determine useful doses and routes for 
administration in humans. 


A therapeutically effective dose refers to that amount of active ingredient, e.g. a 
DPRP or fragment thereof, antibodies of DPRP, or an agonist, antagonist or inhibitor of 
DPRP, which ameliorates particular symptoms or conditions of the Disease. For 
example, the amount to be administered may be effective to cleave a desired target 
substrate upon contact therewith. Therapeutic efficacy and toxicity may likewise be 
determined by standard pharmaceutical procedures in cell cultures or with experimental 
animals, such as by calculating the ED50 (the dose therapeutically effective in 50% of 
the population) or LD50 (the dose lethal to 50% of the population) statistics. The dose 
ratio of toxic to therapeutic effects is the therapeutic index, and it can be expressed as the 
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LD50/ED50 ratio. Pharmaceutical compositions which exhibit large therapeutic indices 
are preferred. The data obtained from cell culture assays and animal studies is used in 
formulating a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that include the 
5 ED50 with little or no toxicity. The dosage varies within this range depending upon the 
dosage form employed, the sensitivity of the patient, and the route of administration. 

An exact dosage will normally be determined by the medical practitioner in light 
of factors related to the subject requiring treatment, with dosage and administration 
being adjusted to provide a sufficient level of the active moiety or to maintain a desired 
10 effect. Factors to be taken into account include the severity of the disease state, the 
general health of the subject, the age, weight, and gender of the subject, diet, time and 
frequency of administration, drug combination(s), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions may be 
administered every 3 to 4 days, every week, or even once every two weeks, depending 
15 on the half-life and clearance rate of the particular formulation. 

Yet another aspect of the invention provides polynucleotide molecules having 
sequences that are antisense to mRNA transcripts of DPRP1, DPRP2 and DPRP-3 
polynucleotides. Administration of an antisense polynucleotide molecule can block the 
production of the protein encoded by DPRP-1, DPRP2 or DPRP-3. The techniques for 
20 preparing antisense polynucleotide molecules and administering such molecules are 
known in the art. For example, antisense polynucleotide molecules can be encapsulated 
into liposomes for fusion with cells. 

In particular, the expression of DPRP-1, DPRP-2 and DPRP-3 in specialized 
epithelial cells, immune cells (lymphocytes and B cells), astrocytic tumors, and in 
25 various hormone sensitive cancers provides evidence of a potential role in the 

pathophysiology of cancer, metaplasia and metastasis. Therefore in a further aspect, the 
invention relates to diagnostic assays for detecting diseases associated with inappropriate 
DPRP activity or expression levels. Antibodies that specifically bind DPRP may be used 
for the diagnosis of disorders characterized by expression of DPRP, or in assays to 
30 monitor patients being treated with DPRP or with agonists or antagonists (inhibitors) of 
DPRP. Antibodies useful for diagnostic purposes may be prepared in the same manner 
as those described above for therapeutics. Diagnostic assays for DPRP include methods 
that utilize the antibody and a label to detect DPRP in human body fluids or in extracts 
of cells or tissues. The antibodies may be used with or without modification, and they 
35 may be labeled by covalent or non-covalent joining with a reporter molecule. A wide 
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variety of reporter molecules are known in the art. Recombinant DPRP proteins that 
have been modified so as to be catalytically inactive can also be used as dominant 
negative inhibitors. Such modifications include, for example, mutation of the active site. 

A variety of protocols for measuring DPRP, including ELISAs, RIAs and FACS, 
5 are known in the art and provide a basis for diagnosing altered or abnormal levels of 
DPRP expression. Normal or standard values for DPRP expression are established by 
combining body fluids or cell extracts taken from normal mammalian subjects, 
preferably human, with antibody to DPRP under conditions suitable for complex 
formation. The method for detecting DPRP in a biological sample would comprise the 
10 steps of: a) providing a biological sample; b) combining the biological sample and an 
anti-DPRP antibody under conditions which are suitable for complex formation to occur 
between DPRP and the antibody; and c) detecting complex formation between DPRP 
and the antibody, thereby establishing the presence of DPRP in the biological sample. 
The amount of complex formation then may be quantified by various methods, 

15 preferably by photometric means. Quantities of DPRP expressed in subject, control, and 
disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding DPRP are 
used for diagnostic purposes, which polynucleotides may include oligonucleotide 
20 sequences, complementary RNA and DNA molecules, and PNAs. These 

polynucleotides may be used to detect and quantitate gene expression in biopsied tissues 
in which expression of DPRP may be correlated with one of the Diseases. The 
diagnostic assay may be used to distinguish between absence, presence, and excess 
expression of DPRP and to monitor regulation of DPRP levels during therapeutic 
25 intervention. Moreover, pharmacogenomic, single nucleotide polymorphisms (SNP) 
analysis of the DPRP genes can be used as a method to screen for mutations that indicate 
predisposition to disease or modified response to drugs. 

DPRP polynucleotide and polypeptide sequences, fragments thereof, antibodies 
of DPRPs, and agonists, antagonists or inhibitors of DPRPs can be used to as discovery 
30 tools to identify molecular recognition events and therefore proteins, polypeptides and 
peptides that interact with DPRP proteins. A specific example is phage display peptide 
libraries where greater than 108 peptide sequences can be screened in a single round of 
panning. Such methods as well as others are known within the art and can be utilized to 
identify compounds that inhibit or enhance DPRP-1, DPRP-2 or DPRP-3 activity. 

35 Coupled links represent functional interactions such as complexes or pathways, and 
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proteins that interact with DPRPs can be identified by a yeast two-hybrid system, 

proteomics (differential 2D gel analysis and mass spectrometry) and genomics 

(differential gene expression by microarray or serial analysis of gene expression SAGE). 

Proteins identified as functionally linked to DPRPs and the process of interaction form 

5 the basis of methods of screening for inhibitors, agonists and antagonists and modulators 
of these DPRP-protein interactions. 




20 


The term "antagonist," as it is used herein, refers to an inhibitor molecule which, 
when bound to DPRP, decreases the amount or the duration of the effect of the 
biological or immunological activity of DPRP, e.g. decreasing the enzymatic activity of 
the peptidase to cleave the N-terminal dipeptide. Antagonists may include proteins, 
nucleic acids, carbohydrates, antibodies, or any other molecules which decrease the 
effect of DPRP; for example, they may include small molecules and organic compounds 


that bind to and inactivate DPRPs by a competitive or non-competitive type mechanism. 


Specific examples of DPRP tetrapeptide peptidic enzyme activity inhibitors are 
described in Example 6 and 7. Inhibitors can be, for example, inhibitors of the DPRP 
protease activity, or alternatively inhibitors of the binding activity of the DPRP to 
proteins with which they interact. Specific examples of such inhibitors can include, for 


example, anti-DPRP antibodies, peptides, protein fragments, or small peptidyl protease 
inhibitors, or small non-peptide, organic molecule inhibitors which are formulated in a 
medium that allows introduction into the desired cell type. Alternatively, such inhibitors 
can be attached to targeting ligands for introduction by cell-mediated endocytosis and 


other receptor mediated events. Such methods are described further below and can be 
practiced by those skilled in the art given the DPRP nucleotide and amino acid 


sequences described herein. 


25 A further use for DPRPs is for the screening of potential antagonists for use as 

therapeutic agents, for example, for inhibiting binding to DPRP, as well as for screening 
for agonists. DPRP, its immunogenic fragments, or oligopeptides thereof can be used 
for screening libraries of compounds which are prospective agonists or antagonists in 
any of a variety of drug screening techniques. The fragment employed in such screening 
30 may be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. The formation of binding complexes between DPRP and the agent being 
tested is then measured. Other assays to discover antagonists that will inhibit DPRP are 
apparent from the disclosures ofU.S. Patents Nos. 6,011,155, 6,107,317, 6,110,949, 
6,124,305 and 6,166,063, which describe inhibitors of DPPIV. Another worthwhile use 
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of these DPRPs is the screening of inhibitors of DPPIV to show that they will not have 
undesired side effects by also inhibiting one or more of the DPRPs. 

A method provided for screening a library of small molecules to identify a 
molecule which binds DPRP generally comprises: a) providing a library of small 
5 molecules; b) combining the library of small molecules with the polypeptide of either 
SEQ ID NOS: 1, 3 or 5, or with a fragment thereof, under conditions which are suitable 
for complex formation; and c) detecting complex formation, wherein the presence of 
such a complex identifies a,small molecule which binds DPRP. 

One method for identifying an antagonist comprises delivering a small molecule 
10 which binds DPRP into extracts from cells transformed with a vector expressing DPRP 
along with a chromogenic substrate (e.g. Ala-Pro-AFC or Ala-Pro-AMC) under 
conditions where cleavage would normally occur, and then assaying for inhibition of 
cleavage by the enzyme by monitoring changes in fluorescence, or UV light absorption, 
by spectrophotometry to identify molecules that inhibit cleavage. A reduced rate of 
15 reaction or total amount of fluorescence or UV light absorption, in the presence of the 
molecule, establishes that the small molecule is an antagonist which reduces DPRP 
catalytic/enzymatic activity. Once such molecules are identified, they may be 
administered to reduce or inhibit cleaving by a DPRP. 

The term "agonist," as used herein, refers to a molecule which, when bound to 
20 DPRP, increases or prolongs the duration of the effect of DPRP. Agonists may include 
proteins, nucleic acids, carbohydrates, or any other molecules that bind to and modulate 
the effect of DPRP. Although it is less likely that small molecules will prove to be 
effective DPRP agonists, a method for identifying such a small molecule, which binds 
DPRP as an agonist, comprises delivering a chromogenic form of a small molecule that 
25 binds DPRP into cells transformed with a vector expressing DPRP and assaying for 
fluorescence or UV light absorption changes by spectrophotometry. An increased 
amount of UV absorption or fluorescence would establish that the small molecule is an 
agonist that increases DPRP activity. 

Another technique for drug screening which may be used provides for high 
30 throughput screening of compounds having suitable binding affinity to the protein of 
interest as described in published PCT application WO84/03564. In this method, large 
numbers of different small test compounds are synthesized on a solid substrate, such as 
plastic pins or some other surface. The test compounds are reacted with DPRP, or with 
fragments thereof, and then washed. Bound DPRP is then detected by methods well 
35 known in the art. Purified DPRP can also be coated directly onto plates for use in the 
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aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies 
can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which 
neutralizing antibodies capable of binding DPRP specifically compete with a test 
5 compound for binding DPRP. In this manner, antibodies can be used to detect the 
presence of any peptide that shares one or more antigenic determinants with DPRP. 

As indicated above, by investigating the binding sites, ligands may be designed 
that, for example, have more interactions with DPRP than do its natural ligands. Such 
antagonist ligands will bind to DPRP with higher affinity and so function as competitive 
10 ligands. Alternatively, synthetic or recombinant proteins homologous or analogous to 
the ligand binding site of native DPRP may be designed, as may other molecules having 
high affinity for DPRP. Such molecules should also be capable of displacing DPRP and 
provide a protective effect. 

As indicated above, the knowledge of the structures of DPRP enables synthetic 
15 binding site homologues and analogues to be designed. Such molecules will facilitate 
greatly the use of the binding properties to target potential therapeutic agents, and they 
may also be used to screen potential therapeutic agents. Furthermore, they may be used 
as immunogens in the production of monoclonal antibodies, which antibodies may 
themselves be used in diagnosis and/or therapy as described hereinbefore. 

20 Given the ubiquitous expression of several members of the prolyl oligopeptidase 

S9B family, cell lines in which targeted gene disruption of DPPIV, DPRP-1, DPRP-2, 
DPRP-3, FAP and DPPVI to establish the null phenotype will be of great value to assist 
screening for selective and potent compounds. Accordingly, the invention provides such 
cell lines engineered with Lox-Neo ERES tk cassette and GFP-IRES-Neo Knock-in/out 
25 cassette DNA element for constructing somatic gene targeting vectors. 

Example 1 

Cion»ng_and Expression of DPRP genes Using the Mammalian Expression System 
DNA fragments encoding the full-length polypeptide DPRP-1 were amplified 
using PCR oligonucleotide primers corresponding to the 5' and 3' sequences of the gene, 

30 i.e. SEQ ID NO:45 and NO:46. In addition, DNA fragments encoding the full length 
polypeptide DPRP-2 were amplified using PCR oligonucleotide primers corresponding 
to the 5' and 3' sequences of that gene, i.e. SEQ ID NO:50 and NO:51. Furthermore, 

DNA fragments encoding the full length polypeptide DPRP-3 were amplified using PCR 
oligonucleotide primers corresponding to the 5' and 3' sequences of that gene, i.e. SEQ 
35 ID NO:55 and NO:56. 
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The three amplified sequences were respectively isolated from a 0.7% agarose 
gel using commercially available kit (GFX PCR DNA and Gel Band Purification Kit, 
Amersham Pharmacia Biotech Inc., Piscataway NJ, USA). The fragments were then 
ligated into cloning vector, pGEM-7Zf(-) (Promega Corporation, Madison WI, USA) 

5 and sequenced. The corresponding cloning constructs were respectively designated 
pGEM7-DPRPl, pGEM7-DPRP2 and pGEM7-DPRP3. The DNA sequences encoding 
the truncated DPRP-1 or DPRP-2 or DPRP-3 were amplified using pGEM7-DPRPl or 
pGEM7-DPRP2 or pGEM7-DPRP3 as a template and BCR oligonucleotide primers. 
SEQ ID NO:45 and NO:47 were used for DPRP-1; SEQ ID NO:50 and NO:52 were used 
10 for DPRP-2; and SEQ ID NO:57 and NO:58 for DPRP-3. The amplified sequences were 
again isolated from a 0.7% agarose gel using the same purification kits and sub-cloned 
into pGEM-7Zf(-). The resulting constructs were designated pGEM7-DPRPlf, pGEM7- 

nPRPOf anrl r>f;PXjf7_nDDI>ir 
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To make the DPRP-I mammalian expression construct, pGEM7-DPRPl was 
15 digested with the restriction enzymes Kpnl and Notl to release the full length DPRP-1 
gene. The DNA fragment carrying the DPRP-1 gene was gel band purified using the 
above kit and then inserted into expression vector pcDNA3 (Invitrogen, Carlsbad CA, 
USA) to make the native DPRP-1 expression construct, which was designated pcDNA- 
DPRP1. pGEM7-DPRPlf was digested with the restriction enzymes Xbal and Hindlll 
20 to release the truncated DPRP-1 f gene. The DNA fragment carrying the DPRP-1 f gene 
was gel band purified using the above kit and then inserted into expression vector 
pcDNA3.1(-)/myc-His A (Invitrogen, Carlsbad CA, USA) to make the tagged DPRP-1 
expression construct pcDNA-MycHis-DPRP 1. 

To make the DPRP-2 mammalian expression construct, pGEM7-DPRP2 was 
25 digested with the restriction enzymes Hindlll and BamHI to release the full length 
DPRP-2 gene. The DNA fragment carrying the DPRP-2 gene was gel band purified 
using the above kit and then inserted into expression vector pcDNA3 (Invitrogen, 
Carlsbad CA, USA) to make the native DPRP-2 expression construct, which was 
designated pcDNA-DPRP2. pGEM7-DPRP2f was digested with the restriction enzymes 
30 EcoRI and BamHI to release the truncated DPRP-2f gene. The DNA fragment carrying 
the DPRP-2f gene was gel band purified using the above kit and then inserted into 
expression vector pcDNA3.1(-)/myc-His B (Invitrogen, Carlsbad CA, USA) to make the 
tagged DPRP-2 expression construct designated pcDNA-MycHis-DPRP2. 

To make the DPRP-3 mammalian expression construct, pGEM7-DPRP3 was 
35 digested with the restriction enzymes EcoRI and Xhol to release the full length DPRP-3 
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gene. The DNA fragment carrying the DPRP-3 gene was gel band purified using the 
above kit and then inserted into expression vector pcDNA3 (Invitrogen, Carlsbad CA, 
USA) to make the native DPRP-3 expression construct designated pcDNA-DPRP3. 
pGEM7-DPRP3f was digested with the restriction enzymes Nhel and Apal to release the 
5 truncated DPRP-3f gene. The DNA fragment carrying the DPRP-3f gene was gel band 
purified using the above kit and then inserted into expression vector pcDNA3.1(-)/myc- 

His B (Invitrogen, Carlsbad CA, USA) to make the tagged DPRP-3 expression construct 
pcDNA-MycHis-DPRP3. 


Expression Pattern of DPRP genes in human tissue 


Quantitative PCR analysis was carried out to examine the levels of expression of 
the mRNAs for the polypeptides of the present invention in human tissues. RT PCR was 


O 1 OA 

W1UU VU1 1 1UU 


out 0.1 a uUuiuer of human ceii iines including but not limited to prostate 


cancer ceUs (LNCaP, PC3, DU 145), the MLTC-1 line (mouse testis), and MDA-MB231 

cells (breast cancer). Bands of the expected sizes for DPRP-1, DPRP-2 and DPPIV were 

all expressed in the various cancer cells lines, with FAP also being expressed at very low 
levels. 


Northern blot analysis was performed with 2pg poly(A) + RNA isolated from 
20 eight different tissues using DPRP probes. Specifically, a human Multiple Tissue 
Northern (MTN) blot (Clontech, Palo Alto, Calif.) was probed with a 1 kb N-terminal 
fragment that had been radioactively labeled by random priming in the presence of a 
32 PdCTP (A. P. Feinberg et al„ Anal. Biochem.. 132, 6 (1983)). Hybridization was 
performed at 68°C overnight in ExpressHyb™ hybridization solution (Clontech, Palo 
25 Alto, Calif.). The blots were first washed at room temperature in 2 times SSC and 

0.05% SDS, and then washed at 60°C (DPRP-1 & DPRP-2) and 50°C (DPRP-3) in 0.1 
times SSC and 0.1% SDS. 

Northern analysis showed expression of DPRP-1 in several tissues with the most 
abundant signal being in testis, prostate, muscle and brain. Testis showed 3 transcripts 
30 approximately 7.5,4.5 and 2.5 kb in length. The shorter mRNA species was very 
abundant in testis but negligible in the other tissues tested. DPRP-2 was ubiquitously 
expressed in every tissue with highest levels in liver and muscle and a predominant 
transcript at 5kb. DPRP-3 expression was limited to brain and pancreas. Further 
analysis was conducted for the three proteases in specific brain regions (cerebellum, 

35 cortex, medulla, spinal cord, occipital lobe, frontal lobe temporal lobe and putamen) 


- 31 - 


i 


WO 02/31134 PCT/IJS01/31874 

DPRP-1 was expressed in all regions with low levels present in the spinal cord, while 
DPRP-2 was expressed in all brain regions tested. 

Oligonucleotide primers SEQ ID NO:48 and NO:49 were used for DPRP-1 
quantitative PCR, whereas oligonucleotide primers SEQ ID NO:53 and NO:54 were 
5 used for DPRP-2 quantitative PCR. Human Multiple Tissue cDNA (MTC™) Panel I 
and Panel II (Clontech, Palo Alto CA, USA) were used as normalized cDNA templates. 
0.5 ng of each cDNA were used in a 25 pi PCR reaction, with each primer at a final 
concentration of 300 nM. The PCR reaction was performed using a SYBR Green PCR 
Core Reagents Kit (Applied Biosystems, Foster City CA, USA) and detected with an 
10 Applied Biosystems GeneAmp 5700 sequence detection system. Manufacturer’s 

recommended thermal cycling parameter, e.g. 50°C for 2 min, 95°C for 10 min followed 
by 40 cycles of 95°C for 15 sec and 60°C for 1 min was used. Data obtained shows 
relatively high rates of expression for both DPRP-1 and DPRP-2 in the pancreas, ovary 
and testis, and a particularly high rate for DPRP-2 in the liver. 

15 Example 3 - Production of DPRP Polyclonal Antibodies and Western Blotting 

The amino acid sequence deduced from the cDNA encoding DPRP-1 was 
analyzed using DNASTAR software (DNASTAR, Inc.) to determine regions of high 
immunogenicity, and a corresponding oligopeptide was synthesized and used to raise 
anti-DPRP-1 antibodies. The procedure was repeated for DPRP-2 and DPRP-3. The 
20 selection of appropriate peptide sequences and the techniques for antibody production 
are methods well known to those of skill in the art Selection of appropriate epitopes, 
such as those near the C-terminus or in hydrophilic regions, is well known in this art. 

Typically, oligopeptides that are about 15 to 20 residues in length, e.g. SEQ ID 
NO:59 for DPRP-1, SEQ ID NO:60 for DPRP-2 and SEQ ID NO:61 for DPRP-3, were 
25 synthesized using an Applied Biosystems Peptide Synthesizer Model 431 A. 

Fmoc-chemistry was used and the 19- or 15-residue peptides were respectively coupled 
to keyhole limpet hemocyanin (KLH, Sigma, St. Louis, Mo.) by reaction with 
N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS). Rabbits were immunized 
with the oligopeptide-KLH complex in complete Freund's adjuvant. The resulting 
30 antisera were tested for antipeptide activity, e.g., by binding the peptide to plastic, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with 
radioiodinated, goat anti-rabbit IgG. 

Western blotting was performed using normal human protein samples (Protein 
Medley) obtained from Clontech (about 36 pg of total proteins). Proteins were 
35 fractionated through 10% SDS-polyacrylamide gels, and transferred to 0.45 mm 
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nitrocellulose membranes. Membranes were blocked in Tris-buffered saline (TBS) with ' 
0.05% Tween 20 and 1% BSA. Anti DPRP-1 or DPRP-2 specific antibodies were used 
as primary antibodies and were diluted 1:5,000 in Tris-buffered saline with 0.05% 

Tween 20 (TBST) and the Alkaline Phosphatase (AP) conjugated goat anti-Rabbit IgG 
5 (Promega) was diluted 1: 5,000 in the same buffer before use. The positive reaction was 
visualized by incubating the membrane in Western Blue Stabilized Substrate (Promega) 
for AP until the bands of interest have reached the desired intensity. DPRP-1 and 
DPRP-2 proteins were detected in brain, muscles, kidney, prostate, testis and ovary 
tissues. DPRP-1 and DPRP-2 were synthesized as approximately lOlkDa and lOOkDa 
10 forms, respectively, which are in good agreement with the molecular masses estimated 
from their primary structure as shown in Table 3. 

Table 3. Predicted Molecular Weight, Number of potential N-linked glycosylation sites 
(Asn residues) and predicted pi values of DPRP-1, DPRP-2 and DPRP-3, based on 
sequence analysis using the method developed by Hopp and Woods, Proc. Nat. Acad. 

15 Sci. 78:3824-3828 (1981). 



M.W. (Da.) 

No. of Asn 

pi 

DPRPl 

101422 

26 

5.39 

DPRP2 

98263 

27 

6.01 

DPRP3 

90914 

33 

6.11 


Several additional bands of similar molecular weight were observed. These are 
20 thought to be due to the presence of post-translational glycosylation of the proteins. 

Table 3 also shows the number of potential N-glycosylation sites for the DPRP proteins. 
The presence of glycosylated and unglcosylated forms of the proteins was evaluated 
using tunicamycin, an inhibitor of the oligosaccharide synthesis. It is evident that the 
smaller forms were unglycosylated forms. The correlation between mRNA (Northern 
25 analysis) and protein quantity (Western analysis) for DPRP-1 is shown in Table 4. 


Table 4. Correlation of mRNA and protein expression of DPRP-1 in human tissues 



Heart 

Brain 

Placenta 

Muscles 

Kidney 

Prostate 

Testis 

EJB« 

eehehei 

++ 

+++ 

+ 

+++ 

++ 

+-H- 

egg 

+ 



++++ 

- 

+ 

-H- 

L——_ 

+ 


+++ 


- 33 - 
























WO 02/31134 


PCT/USO1/31874 



Four-micron sections were prepared from a number of different formalin-fixed, 
paraffin-embedded human tissues. Tissue sections were deparaffmed through 4 
5 immersions in xylenes for 5 minutes, followed by a graded alcohol series to distilled 
water. Steam heat induced epitope recovery (SHIER) was used with several different 
SHIER solutions with and without enzyme digestion tissue in two different 
concentrations (Ladner et al, Cancer Res. : 60, p 3493-3503, 2000). The treatments and 
antibody dilutions employed are outlined below. 

10 1. Blocking Reagent for 15 minutes (Normal Goat Serum) 

2. Primary Antibody for 25,60 min or overnight incubation 

3. Secondary Antibody for 25 minutes (Biotinylated Goat-anti-rabbit IgG) 

4. Endogenous Peroxidase Blocking for 3 x 1.5 minutes 

5. ABC (avidin-biotin complex) / Horse Radish Peroxidase for 25 minutes 

15 6. DAB Chromogen for 3 x 5 minutes (Brown reaction product) 

7. Light Hematoxylin Counter Stain 1 minute 

Positive controls were run to assure the detection chemistries and antigen 
pretreatments were working appropriately. Rabbit IgG was run as a negative control. 

An avidin-biotin based tissue staining system was used for the detection of the DPRP-1 
20 antibody. Horseradish peroxide was used as a reporter enzyme with DAB as chromogen. 
After staining, slides were dehydrated through an alcohol series to absolute ethanol 
followed by xylene rinses. Slides were permanently coverslipped with glass coverslips 
and permount. Digital images of representative staining, where positive staining was 
indicated by a dark brown chromogen (DAB-HRP reaction product), were captured 
25 using a video camera from Olympus. Hematoxylin counterstain provides a blue nuclear 
stain to assess cell and tissue morphology. 

DPRP-1 rabbit polyclonal antibody labels formalin-fixed, paraffin-embedded 
human tissues, including normal testis, prostate glands, endometrial glands, tonsils and 
pancreas. It was also present in endothelial cells of normal ovary, bladder and kidney. 

30 Staining was localized in the cytoplasm in epithelial and some stromal cells such as 

fibroblasts, endothelial cells and lymphocytes. Interestingly in normal testis tested with 
DPRP-1 antibodies, there was distinctive expression in Leydig cells and multinucleated 
macrophages found in interstitial tissue, which is the space surrounding the seminiferous 
tubules. Tonsil B cells were stained with DPRP-1 antibody. 
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5 



15 


Example 5 



Purification 


Plasmid DNA of pcDNA-DPRPl, pcDNA-MycHis-DPRPl, pcDNA-DPRP-2 or 
pcDNA-MycHis-DPRP2 was transfected into PEAK (EdgeBioSystems, Gaithersburg 
MD, USA) or COS-1 (ATCC CRL-1650) using LipofectAmine (Life Technologies, 
Gaithersburg MD, USA) method recommended by the manufacturer. Transfected cells 
were maintained in DMEM with 5% FBS at 37°C with 5% CO, for 48 hours. Cells were 
then collected and used for recombinant protein extraction. Cells were harvested 48 
hours after transfection, homogenized and then spun at 18,000 x g for 40 min. The 
supemata were collected as cytosolic fractions. This fraction was loaded on TALON 
spin column (Clontech), and His-tagged proteins were eluted with 50mM PBS, 150mM 
imidazole, pH 7. Recombinant proteins were then detected by western blotting with 
anti-myc antibody and visualized usings. ProtoBIot IIAP system (Promega). 
Recombinant affinity purified fusions of the DPRP-1 and DPRP-2 were detected by 
western blot, and DPRP-1 and DPRP-2 were synthesized as 112kDa and 109kDa forms 


as predicted. 


Naturally occurring or recombinant DPRP proteins were substantially purified by 
immunoaffinity chromatography using antibodies specific for DPRP-1, DPRP-2 or 
DPRP-3. An immunoaffinity column was constructed by covalently coupling DPRP 
20 antibodies to an activated chromatographic resin, such as CNBr-activated Sepharose 

(Pharmacia & Upjohn). After the coupling, the resin was blocked and washed according 
to the manufacturer's instructions. 


Media or cell extracts containing DPRP proteins were passed over the 
immunoaffinity column, and the column was washed under conditions that allow the 
25 preferential absorbance of DPRPs (e.g., high ionic strength buffers in the presence of 
detergent). The column was eluted under conditions that disrupt antibody/DPRP binding 
(e.g., a buffer of pH 2-3 or a high concentration of a chaotrope, such as urea or 
thiocyanate ion), and purified DPRP was collected. 

Example 6 

30 Enzymatic Activity of DPRP proteins and Methods of Screening for Inhibitors 

The kinetic properties of recombinant DPRP-1 and DPRP-2 were determined in a 
continuous fluorimetric assay. Buffer, pH and temperature dependence optimization led 
to the following assay conditions: Enzyme assays were performed in 50mM PBS, pH7.4 
50 pi (50 pg/ml) of purified enzymes were mixed with 1 pi of different concentration of 
35 Ala-Pro-AMC (Enzyme Systems). Plates were then incubated at 37°C for 30 min, and 
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fluorescence was detected using a Wallac 1420 Fluorimeter with A.ex40355 and A.em535. 
The K^, values of DPRP-1 and DPRP-2 were similar (208 and 161 respectively). 

Further biochemical characterization reveals that DPRJP-1 and DPRP-2 have 
similar profiles to DPPIV. The two purified proteases and DPPIV were preincubated 
5 with inhibitors at room temperature for 30 min. Substrate, Ala-Pro-AMC (100 pM), was 
then added, and the fluorescence intensity was recorded as 60 readings during a 60 min 
period. The irreversible serine protease inhibitor AEBSF was the only inhibitor tested 
that showed strong inhibition of all three enzymes (Table 5). This confirms the structural 
and domain analysis prediction that these proteins belong to the serine protease 
10 superfamily. 

Table 5 . Inhibition of DPRP-1 and DPRP-2 by Protease Inhibitors 


Inhibitor 

Inhibitor Property- 

Concentration 

Residual activity 

(% of control) 

DPRP-1 

DPRP-2 

DPPIV 

AEBSF 

serine, irreversible 

5mM 

29.6 

23.9 

21.1 

Aprotinin 

serine, reversible 

5pg/ml 

77.5 

63.2 

80.2 

Pepstatin 

aspartic, reversible 

2pg/ml 

97.3 

95.0 

93.5 

DTT 

cysteine 

2mM 

100.1 

94.8 

98.3 

B-Mercaptoethonal 

cysteine 

lOOmM 

93.2 

84.0 

98.0 

EDTA 

metallo, reversible 

2mM 

91.5 

86.0 

93.5 

Leupeptin 

serine, reversible 

50|ig/ml 

91.1 

90.4 

90.7 


20 In addition to Ala-Pro-AMC, additional substrates tested also confirmed that 

DPRP-1 and DPRP-2 are dipeptidyl peptidases. The data were derived by determining 
the fluorescence change following a 30-minute incubation of the substrates (125 pM) 
with enzymes as a percentage of the fluorescence measured at Ala-Pro-AMC and Gly- 
Pro-AMC were the only good substrates among those tested. 

25 Table 6. DPRP-1 and DPRP-2 are dipeptidyl peptidases. 


Substrate 

% Change in Fluorescence at 30 minutes 

DPRP-1 

DPRP-2 

DPPIV 

Ala-Pro-AMC 

239.0 

127.5 

379.0 

Gly-Pro-AMC 

341.5 

205.0 

444.0 

Ala-Pro-pNA 

45.5 

44.0 

29.5 

Pro-pNA 

-1 

-2.5 

0.0 

Gly-Arg-pNA 

-4.5 

-0.5 

0.0 

Lys-Ala-pNA 

2.5 

0.5 

0.5 

Ala-Phe-Pro-pNA 

-4 

-0.5 

2.0 
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Additional natural and non-natural amino acid di-, tri- and tetra-peptides were tested in 
order to find an optimal substrate for testing each of the DPRP proteins that will also 
show reduced activity when incubated DPPIV. 

The enzyme assay method described here is one of a number of methods that can 
5 be utilized to screen for peptide and non-peptide inhibitors of the DPRP enzymes. 

Libraries of tetrapeptide inhibitors were tested to discover inhibitors of enzyme activity. 
Candidate inhibitors were prepared as 10-20 mM stock solutions in DMSO and stored at 
-20°C. Dilutions were made in assay buffer. Inhibition was determined by comparing 
the changes in fluorescence of the inhibited enzyme to the change in fluorescence of the 
10 control (vehicle) enzyme. 100-(fl units of sample/fl units of control x 100) gives 

percent inhibition value. The percent inhibition and the inhibitor concentration at which 
the enzyme was 50% inhibited (IC 50 ) was ascertained by plotting percent inhibition vs. 
inhibitor concentration on the log scale^ As shown in Figure 3, several tetrapeptides 
amides inhibited enzyme activity, wherein data are expressed as the % of activity in the 
15 presence of vehicle (0.02% DMSO) alone. Compounds were added at 1 mM. Most 
interesting was the apparent differential activity of some tetrapeptides for DPRP-1 and 
DPRP-2, compared to DPPIV. While all three enzymes were inhibited by Peptide-1, 
only DPRP-1 and DPRP-2 were significantly inhibited by Peptide-4 and Peptide-5. This 
demonstrates that selective inhibition of the purified enzymes is achievable. 

20 The assay described in this example can also be used to screen additional 

synthetic or naturally occurring compound libraries, including macromolecules, for 
agents that either inhibit or enhance DPRP activity. The DPRP-1 and DPRP-2 
polypeptides to be used in the assay can be obtained by, for example, in vitro translation, 
recombinant expression (see Example 5) or biochemical procedures. Methods other than 
25 those described here can also be used to screen and identify compounds that inhibit 
DPRP-1, DPRP-2 or DPRP-3, which methods can include, for example, binding assays 
such as ELISAs and RIAs. 

Example 7 

Effect of DPRP Inhibitors on the Proliferation of Human Cancer Cells In Vitro 
30 In an attempt to assess the effect that several inhibitors of DPRP-1 and DPRP-2 

activity may have on the proliferation of human cancer cells, LNCap, PC3 and Dul45, 
mouse testis line MLTC-1 and MDA-MB231 breast cancer cells were plated (10 4 per 
well) in 96-well tissue culture plates and allowed to grow and attach for 24 hours at 37°C 
in a C0 2 incubator. Compounds at various dilutions (final dilutions: 0.1 nM - 10 pM) 

35 were then added to the wells for various incubation periods from 24 hours to 96 hours, 
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with fresh compound being replaced each day. Addition of the diluent DMSO alone 
served as the control. Following incubation with these compounds in triplicate, 
proliferation of the cells was determined using an XTT cell proliferation assay (Roche 1 - 
465-015). The plates were read at 490 and 650nm 5 hours after the XTT mix was added. 

5 An increase in cell proliferation was observed with three of the inhibitors at 

concentrations equal to 0.1, 1, 10 and 100 x IC 50> and the results are shown in FIGS. 4A, 
4B and-4C for PC3 cells. 

Overall, the DPRPs are expressed in a wide variety of tissues as has been 
demonstrated by mRNA amplification, western blotting and immunohistochemistry. 

10 DPRP-1 was most abundant in the testis by Northern blot and western blot. The large 
number of expressed sequence tags (ESTs) from testis cDNA sources that are 
homologous to DPRP-1 also confirms abundant expression of DPRP-1 in testis. 

Example 4 describes the hnniunohisiocheniica] localization of DPRP-i protein in human 
testis using a specific DPRP-1 antibody. DPRP-1 is strongly expressed in epitheloid 
15 Leydig cells, and Leydig cells are the primary source of testicular androgens (male 

steroid hormones) in the mammalian male. In the interstitium of the testis, Leydig cells 
and macrophages are in close association with “digitation” of Leydig cell process 
extending onto macrophage surface. Multinucleated cells in close proximity to the 
Leydig cells were also stained with DPRP-1 antibody suggesting that the protease was 
20 also expressed in macrophages, and macrophages in the testis play an important role in 
the paracrine regulation of Leydig cells. Cytokines secreted by the testicular 
macrophages are mitogenic to Leydig cells and play an important role in the 
differentiation of mesenchymal progenitor cell into mature Leydig cells. A clearer 
understanding of the proteins and pathways involved in the maturation of the testis is 
25 important for the discovery of new treatments for precocious puberty. In addition, 

Leydig cells cause tumors such as sex cord-stromal tumors via sexual steroid production 
(predominantly testosterone). Testosterone is associated with several neoplasia and 
diseases such as breast carcinoma and uterine cancers, ovarian carcinoma and 
androgenic alopecia (hair loss). Further examination of the localization of DPRP 
30 proteins in other glands in the body (e.g. adrenal glands) that produce testosterone and 
other androgenic hormones are currently under investigation. The possible association 
of DPRP-1 with steroid and polypeptide hormone biosynthetic pathways functions is 
being investigated, and Example 7 is relevant to understanding the role of DPRP proteins 
in prostate, testis and breast in vitro cell models. 
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Immunohistochemical analysis also localized DPRP-1 to endometrial glands in 
the uterus (see Example 4), pancreatic acini, glomeruli of the kidney, plasma cells in the 
bladder, a subset of B-cells in the tonsils, columnar epithelial cells of the prostate and 
poorly differentiated prostate squamous metaplasia, Gleason grade 4 prostatic 
5 carcinoma, and hyperplastic glands in benign prostatic hyperplasia. Positive staining in 
breast carcinoma, as well as in seminoma and prostate squamous metaplasia, suggests a 
general association of DPRP-1 with hormone-sensitive tissues, particularly in cells that 
become poorly differentiated. The presence of the DPRP-1 in specialized epithelial cells 
and in inflammatory plasma cells (lymphocytes) is also of interest. Inflammatory breast 
10 carcinoma has an abundance of infiltrating lymphocytes and an overall bad prognosis. 
DPRP-1 and other DPRP proteins appear in medullary carcinomas that typically have a 
constant infiltrating lymphoplasmacytic component at the periphery of the tumor, which 
is thought to represent a reaction of the host tissues to the neoplasm. Most of the 
lymphocytes are T Cells, and most of the plasma cells are of the IgG-producing type. 

15 Several antigens are abundant on B cells, a subgroup of breast-cancer cells, and other 
epithelial cancer cells, and these antigens are targets for a new class of therapeutic 
monoclonal antibodies with some notable success having been achieved with a 
humanized monoclonal antibody against the B-cell-specific antigen CD20. 

Accordingly, monoclonal antibodies to DPRP proteins are felt to be useful to diagnose 
20 and treat diseases in which they are involved, including cancer. 

The expression of DPRP-1 in specialized epithelial cells of a number of tissues 
suggests that DPRP-1 and other DPRP proteins may be involved in growth and 
differentiation thereof. Testing using inhibitors described in Example 6 in in vitro 
models of prostate and testis cancer (Example 7) showed that DPRP-l/DPRP-2 
25 inhibitors caused a 50-60% increase in proliferation of PC3 cells at nM concentrations as 
shown in FIGS. 4A-4C. 

Although the invention has been described in accordance with its preferred 
embodiments, which constitute the best mode presently known to the inventors, it should 
be understood that changes and modifications as would be obvious to those skilled in 
30 this art may be made without departing from its scope which is set forth in the claims 
appended hereto. For example, although the disclosure focuses on DPRP-1 and DPRP-2 
in certain instances, DPRP-3 and its fragments are considered to be similarly useful, as 

are nucleic acids encoding same. Particular features of the invention are emphasized in 
the claims that follow. 
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CLAIMS: 

1. Isolated nucleic acid 

which encodes (a) a polypeptide, which includes the amino acid sequence of 
one of SEQ ID NOS: 1, 3 and 5, or (b) a polypeptide having an amino acid sequence that is 
at least about 70% similar thereto and exhibits the same biological function; 

or which is an alternative splice variant of one of SEQ ID NOS:2, 4 and 6; or 
which is a probe comprising at least 14 contiguous nucleotides from said 
nucleic acid encoding (a) or (b); or 

which is complementary to any one of the foregoing. 

2. The isolated nucleic acid of claim 1 which is DNA or RNA. 

3. The isolated nucleic acid of claim 1 which is a DNA transcript that includes 
the entire length of any one of SEQ ED NOS:2,4 and 6 or which is complementary to the 
entire coding region of one of SEQ ID NOS:2, 4 and 6. 

4. An antisense oligonucleotide directed against the DNA of claim 3. 

5. The isolated nucleic acid of claim 1 which is an RNA transcript which 
includes the entire length of any one of SEQ ID NOS:2, 4 and 6. 

6. The isolated nucleic acid of claim 1 which is an alternative splice variant of 
one of SEQ ID NOS:2,4 and 6. 

7. A polypeptide encoded by the nucleic acid of claim 6. 

8. The isolated nucleic acid of claim 1 which encodes a polypeptide having an 
amino acid that is at least about 90% similar to one of SEQ ID NOS: 1, 3 and 5. 

9. The isolated nucleic acid of claim 1 which encodes a polypeptide having an 
amino acid that is at least about 95% similar to one of SEQ ID NOS: 1, 3 and 5. 

10. The isolated nucleic acid of claim 1 which encodes a polypeptide that has at 
least about 90% identity with one of SEQ ED NOS:l, 3 and 5. 

11. A nucleic acid probe according to claim 1 comprising at least 14 contiguous 
nucleotides from one of SEQ ID NOS:2,4 and 6. 

12. An isolated recombinant polynucleotide molecule comprising nucleic acid 
according to claim 1 plus expression-controlling elements linked operably with said nucleic 
acid to drive expression thereof. 

13. An expression vector comprising the nucleic acid of claim 1 encoding a 
polypeptide having the entire amino acid sequence set forth in any one of SEQ ID NOS:l, 3 
and 5 operably linked to a promoter, said expression vector being present in a 
compatible host cell. 
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14. A mammalian, insect or bacterial host cell that has been genetically 
engineered by the insertion of nucleic acid according to claim 1 which codes for at least the 
mature protein portion of the amino acid sequence of SEQ ID NO: 1 3 or 5 

15. A process for producing a polypeptide which includes the mature protein 
portion of one of SEQ ID NOS.l, 3 and 5, which process comprises culturing the host ceil 
of claim 11 under conditions sufficient for the production of said polypeptide. 

16. The process of claim 15 wherein said polypeptide is expressed at the 

surface of said cell and further includes the step of recovering the polypeptide or a fragment 
thereof from the culture. 

17. A polypeptide 

which may be optionally glycosylated, and 

which (a) has the amino acid sequence of a mature nrntein form -- 

* —- - - w«. wu tU UUJf UliC 

of SEQ ID NOS: 1, 3 and 5; (b) has the amino acid sequence of a mature protein having at 
least about 70% similarity to one of the mature proteins of (a) and which exhibits the same 
biological function; (c) has the amino acid sequence of a mature protein having at le as t 
about 90% identity with a mature protein of any of SEQ ID NOS.l, 3 and 5; or (d) is an 
immunologically reactive fragment of (a). 

18. The polypeptide according to claim 14 which is a mature protein having at 
least about 95% similarity to a mature protein of (a). 

19. The polypeptide according to claim 14 which is a mature protein having at 
least about 95% similarity to a mature protein of (a). 

20. The polypeptide according to claim 14 having the amino acid sequence of the 
mature protein of one of SEQ ID NOS: 1, 3 and 5, or is a fragment thereof which exhibits 
the same biological function as the respective mature protein. 

21. A DPRP antagonist which inhibits the biological function of one of said 
mature proteins of claim 17, 18 and 19. 

22. An antibody that recognizes a polypeptide or a fragment according to 
claim 17. 

23. The antibody of claim 22 which recognizes a polypeptide having an amino 
acid sequence of SEQ ID NO: 1 or 3 or 5. 

24. A method for the screening for a compound capable of inhibiting the 
enzymatic activity of at least one mature protein of claim 17, which method comprises 
incubating said mature protein and a suitable substrate for said mature protein in the 
presence of one or more test compounds or salts thereof, measuring the enzymatic activity 
of said mature protein, comparing said activity with comparable activity determined in the 
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absence of a test compound, and selecting the test compound or compounds that reduce the 
enzymatic activity. 

25. A method for the screening for a compound capable of inhibiting the 
enzymatic activity of DPPIV that does not inhibit the enzymatic activity of at least one of 
the mature proteins of claim 20, which method comprises incubating said mature protein 
and a suitable substrate for said mature protein in the presence of one or more inhibitors of 
DPPIV or salts thereof, measuring the enzymatic activity of said mature protein, comparing 
said activity with comparable activity determined in the absence of the DPPIV inhibitor, 
and selecting a compound that does not reduce the enzymatic activity of said mature 
protein. 
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hDPRPl 600 
mDPRPl 601 


FWATILD3AGPLPDYTPPEIFSFESTTGFTLYGMLYKPI1DLQPGKKYPTVLFIYGGPQVC 
FWATILDGAGPLPDYTPPETFSFESTTGFTLYG MLY'KPUDLQPGKKYPTVLFIYGGPOVC 


hDPRPl 660 
mDPRPl 661 


LVEMRFKGVKYFRI.NTLASLGYVVVVIDNRGSCHRGLKFEGAFKYKMGQIEIDDQVEGLC 

r.VNNRFKGVKYFRr.NTLASLGYVVVVIDMRGSCHRGLKFEGAFKYKMGOIEIDDOVEGI.C 


hDPRPl 720 
mDPRPl 721 


Y'l.ASSYDFr DLDRVG1HGWS YGG j'LSI.KAI.MQRSDlFRVAIAGAPVTLWIFYDTCYTERY 
YLASgYDFTDLDRVGIHGWSYCGY'I.SLMALMQRSDT FRVAIAGAPVTLWIFYDTGYTERY 


hDPRPl 780 
mDPRPl 781 


MGHPDQNEQGYYI.GSVAMQAEKFPSEPNRLLLLHGFLDENVHFAHTSILLSFLVRAGKPY 

mgiipdqnf.qgyylgsvamgaekfpsepnrllllhgfldenvhfahtsillsflvragkpy 


hDPRPl 840 
mDPRPl 841 


DLQIYPQERHSIRVPESGF.HYELHLLHYLQENLGSRIAALKVl 

DLQIYPQERHSIRVPF.SGEHYELHLLHYI.QENLGSRIAAI.KVI 


FIG. 2 
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Sequence Listing Summary 


SEQ ID. 

1. DPRP1 a.a. sequence 

2. DPRP1 DNA sequence 

5 3. DPRP2 a. a. sequence 

4. DPRP2 DNA sequence 

5. DPRP-3 a.a. sequence 

6. DPRP-3 DNA sequence 

7. DPRP-1 transcript 0 a.a. sequence 

10 8. DPRP-1 transcript 0 DNA sequence 

9. DPRP-1 transcript 1 a.a. sequence 

10. DPRP-1 transcript 1 DNA sequence 

11. DPRP-1 transcript 2 a.a. sequence 

12. DPRP-1 transcript 2 DNA sequence 

15 13. DPRP-1 transcript 3 a.a. sequence 

14. DPRP-1 transcript 3 DNA sequence 

15. DPRP-1 transcript 4 a.a. sequence 

16. DPRP-1 transcript 4 DNA sequence 

17. DPRP-1 transcript 5 a.a. sequence 

20 18. DPRP-1 transcript 5 DNA sequence 

19. DPRP-1 transcript 6 a.a. sequence 

20. DPRP-1 transcript 6 DNA sequence 

21. DPRP-1 transcript 7 a.a. sequence 

22. DPRP-1 transcript 7 DNA sequence 

25 23. DPRP-2 transcript 0 a.a. sequence 

24. DPRP-2 transcript 0 DNA sequence 

25. DPRP-2 transcript 1 a.a. sequence 

26. DPRP-2 transcript 1 DNA sequence 

27. DPRP-2 transcript 2 a.a. sequence 

30 28. DPRP-2 transcript 2 DNA sequence 

29. DPRP-2 transcript 3 a.a. sequence 

30. DPRP-2 transcript 3 DNA sequence 

31. DPRP-2 transcript 4 a.a. sequence 

32. DPRP-2 transcript 4 DNA sequence 

35 33. DPRP-2 transcript 5 a.a. sequence 
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34. DPRP-2 transcript 5 DNA sequence 

35. DPRP-2 transcript 6 a.a. sequence 

36. DPRP-2 transcript 6 DNA sequence 

37. DPRP-2 transcript 7 a.a. sequence 

5 38. DPRP-2 transcript 7 DNA sequence 

39. DPRP-2 transcript 8 a.a. sequence 

40. DPRP-2 transcript 8 DNA sequence 

41. DPRP-3 transcript 0 a.a. sequence 

42. DPRP-3 transcript 0 DNA. Sequence 

10 43. DPRP-3 transcript 1 a.a. sequence 

44. DPRP-3 transcript 1 DNA sequence 

45. DPRP1 forward primer used for cloning 

46. DPRP 1 reverse primer used for cloning full length gene 

47. DPRP1 reverse primer used for cloning fusion gene 

15 48. DPRP 1 forward primer used for expression profiling 

49. DPRP1 reverse primer used for expression profiling 

50. DPRP2 forward primer used for cloning 

51. DPRP2 reverse primer used for cloning full length gene 

52. DPRP2 reverse primer used for cloning fusion gene 

20 53. DPRP2 forward primer used for expression profiling 

54. DPRP2 reverse primer used for expression profiling 

55. DPRP3 forward primer used for cloning 

56. DPRP3 reverse primer used for cloning full length gene 

57. DPRP3 forward primer used for cloning fusion gene 

25 58. DPRP3 reverse primer used for cloning fusion gene 

59. DPRP1 peptide antigen sequences 

60. DPRP2 peptide antigen sequences 

61. DPRP3 peptide antigen sequences 
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SEQUENCE LISTING 

<110> Qi, Steve 

Akinsanya, Karen 
Riviere, Pierre 
Junien, Jean-Louis 

<120> NOVEL SERINE PROTEASE GENES RELATED TO DPPIV 

<130> 70669 

<150> US 60/240,117 

<151> 2000-10-12 

<160> 61 

<170> Patent In version 3.1 


< 210 > 1 
<211> 862 
<212> PRT 
<213> Homo sapiens 
<400> 1 


Met 

1 

Ala 

Ala 

Ala 

Met 

5 

Glu 

Thr 

Glu 

Gin 

Leu 

10 

Gly Val 

Glu 

lie 

Phe 

15 

Glu 

Thr 

Ala 

Asp 

Cys 

20 

Glu 

Glu 

Asn 

lie 

Glu 

25 

Ser 

Gin Asp 

Arg 

Pro 

30 

Lys 

Leu 

Glu 

Pro 

Phe 

35 

Tyr 

Val 

GlU 

Arg 

Tyr 

40 

Ser 

Trp 

Ser Gin 

Leu 

45 

Lys 

Lys 

Leu 

Leu 

Ala 

50 

Asp 

Thr 

Arg 

Lys 

Tyr 

55 

His 

Gly 

Tyr 

Met Met 
60 

Ala 

Lys 

Ala 

Pro 

His 

65 

Asp 

Phe 

Met 

Phe 

Val 

70 

Lys 

Arg 

Asn 

Asp 

Pro Asp 
75 

Gly 

Pro 

His 

Ser 

80 

Asp 

Arg 

lie 

Tyr 

Tyr 

85 

Leu 

Ala 

Met 

Ser 

Gly 

90 

Glu Asn 

Arg 

Glu 

Asn 

95 

Thr 

Leu 

Phe 

Tyr 

Ser 

100 

Glu 

lie 

Pro 

Lys 

Thr 

105 

lie 

Asn Arg 

Ala 

Ala 

110 

Val 

lieu 

Met 

Leu 

Ser 

115 

Trp 

Lys 

Pro 

Leu 

Leu 

120 

Asp 

Leu 

Phe Gin 

Ala 

125 

Thr 

Leu 

Asp 

Tyr 

Gly Met 
130 

Tyr 

Ser 

Arg 

Glu 

135 

Glu 

Glu 

Leu 

Leu Arg 
140 

Glu Arg 

Lys 

Arg 

He 

145 

Gly Thr 

Val 

Gly 

lie 

150 

Ala 

Ser 

Tyr 

Asp 

Tyr His 
155 

Gin Gly 

Ser 

Gly 

160 

Thr 

Phe 

Leu 

Phe 

Gin 

165 

Ala 

Gly 

Ser 

Gly 

lie 

170 

Tyr His 

val 

Lys 

Asp 

175 

Gly 

Gly 

Pro 

Gin 

Gly 

180 

Phe 

Thr 

Gin 

Gin 

Pro 

185 

Leu 

Arg Pro 

Asn 

Leu 

190 

Val 

Glu 

Thr 

Ser 

Cys 

195 

Pro 

Asn 

lie 

Arg 

Met 

200 

Asp 

Pro 

Lys Leu 

Cys 

205 

Pro 

Ala 

Asp 

Pro 

Asp 

210 

Trp 

lie 

Ala 

Phe 

lie 

215 

His 

Ser 

Asn 

Asp lie 
220 

Trp 

lie 

Ser 

Asn 

lie 

225 

Val 

Thr 

Arg 

Glu 

Glu 

230 

Arg 

Arg 

Leu 

Thr 

Tyr Val 
235 

His 

Asn 

Glu 

Leu 

240 

Ala 

Asn 

Met 

Glu 

Glu 

245 

Asp 

Ala 

Arg 

Ser 

Ala 

250 

Gly Val 

Ala 

Thr 

Phe 

255 

Val 
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Leu Gin Glu Glu Phe Asp Arg Tyr Ser Gly Tyr Trp Trp Cys Pro Lys 

260 265 270 

Ala Glu Thr Thr Pro Ser Gly Gly Lys lie Leu Arg lie Leu Tyr Glu 
275 280 285 


Glu Asn Asp Glu Ser Glu Val Glu 
290 295 

Leu Glu Thr Arg Arg Ala Asp Ser 
305 310 

Ala Asn Pro Lys Val Thr Phe Lys 

325 

Glu Gly Arg lie lie Asp Val He 

340 

Glu lie Leu Phe Glu Gly Val Glu 
355 360 

Pro Glu Gly Lys Tyr Ala Trp Ser 
370 375 

Arg Leu Gin lie Val Leu lie Ser 
385 390 

Asp Asp Val Met Glu Arg Gin Arg 

405 

"val Thr pro Leu lie lie Tyr Glu 

420 

lie His Asp lie Phe His Val Phe 
435 440 

Glu Phe lie Phe Ala Ser Glu Cys 
450 455 

Lys lie Thr Ser lie Leu Lys Glu 
465 470 

Gly Leu Pro Ala Pro Ser Asp Phe 

485 

Ala lie Thr Ser Gly Glu Trp Glu 

500 

lie Gin Val Asp Glu Val Arg Arg 
515 520 

Asp Ser Pro Leu Glu His His Leu 
530 535 

Gly Glu Val Thr Arg Leu Thr Asp 
545 550 

lie Ser Gin His Cys Asp Phe Phe 

565 

Asn Pro His Cys Val Ser Leu Tyr 

580 

Pro Thr Cys Lys Thr Lys Glu Phe 
595 600 

Gly Pro Leu Pro Asp Tyr Thr Pro 
610 615 

Thr Thr Gly Phe Thr Leu Tyr Gly 
625 630 

Gin Pro Gly Lys Lys Tyr Pro Thr 

645 

Gin Val Gin Leu Val Asn Asn Arg 

660 

Leu Asn Thr Leu Ala Ser Leu Gly 
675 680 

Arg Gly Ser Cys His Arg Gly Leu 
690 695 

Lys Met Gly Gin lie Glu lie Asp 
705 710 


lie lie His Val Thr Ser Pro Met 

300 

Phe Arg Tyr Pro Lys Thr Gly Thr 
315 320 

Met Ser Glu lie Met lie Asp Ala 
330 335 

Asp Lys Glu Leu lie Gin Pro Phe 
345 350 

Tyr lie Ala Arg Ala Gly Trp Thr 

365 

lie Leu Leu Asp Arg Ser Gin Thr 

380 

Pro Glu Leu Phe lie Pro Val Glu 
395 400 

Leu lie Glu Ser Val Pro Asp Ser 
410 415 

Glu Thr Thr Asp lie Trp lie Asn 
425 430 

Pro Gin Ser His Glu Glu Glu lie 

445 

Lys Thr Gly Phe Arg His Leu Tyr 

460 

Ser Lys Tyr Lys Arg Ser Ser Gly 
475 480 

Lys Cys Pro lie Lys Glu Glu lie 
490 495 

Val Leu Gly Arg His Gly Ser Asn 
505 510 

Leu Val Tyr Phe Glu Gly Thr Lys 

525 

Tyr Val Val Ser Tyr Val Asn Pro 

540 

Arg Gly Tyr Ser His Ser Cys Cys 
555 560 

He Ser Lys Tyr Ser Asn Gin Lys 
570 575 

Lys Leu Ser Ser Pro Glu Asp Asp 
585 590 

Trp Ala Thr lie Leu Asp Ser Ala 

605 

Pro Glu lie Phe Ser Phe Glu Ser 

620 

Met Leu Tyr Lys Pro His Asp Leu 
635 640 

Val Leu Phe lie Tyr Gly Gly Pro 
650 655 

Phe Lys Gly Val Lys Tyr Phe Arg 
665 670 

Tyr Val Val Val Val lie Asp Asn 

685 

Lys Phe Glu Gly Ala Phe Lys Tyr 

700 

Asp Gin Val Glu Gly Leu Gin Tyr 
715 720 
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Leu Ala Ser Arg Tyr Asp Pbe lie Asp Leu Asp Arg Val Gly lie His 

725 730 735 

Gly Trp Ser Tyr Gly Gly Tyr Leu Ser Leu Met Ala Leu Met Gin Arg 

740 745 750 

Ser Asp lie Pbe Arg Val Ala lie Ala Gly Ala Pro Val Thr Leu Trp 
755 760 765 

lie Phe Tyr Asp Thr Gly Tyr Thr Glu Arg Tyr Met Gly His Pro Asp 
770 775 780 

Gin Asn Glu Gin Gly Tyr Tyr Leu Gly Ser Val Ala Met Gin Ala Glu 

785 790 795 800 

Lys Phe Pro Ser Glu Pro Asn Arg Leu Leu Leu Leu His Gly Phe Leu 

805 810 815 

Asp Glu Asn Val His Phe Ala His Thr Ser lie Leu Leu Ser Phe Leu 

820 825 830 

Val Arg Ala Gly Lys Pro Tyr Asp Leu Gin lie Tyr Pro Gin Glu Arg 
835 840 845 

His Ser lie Arg Val Pro Glu Ser Gly Glu His Tyr Glu Leu His Leu 
850 855 860 

Leu His Tyr Leu Gin Glu Asn Leu Gly Ser Arg lie Ala Ala Leu Lys 

865 870 875 880 

tt. ^ * 

VCt Ja X1C 

< 210 > 2 

<211> 2671 

<212> DNA 

<213> Homo sapiens 

<400> 2 

cggtaccatg gcagcagcaa tggaaacaga acagctgggt gttgagatat ttgaaactgc 60 

ggactgtgag gagaatattg aatcacagga tcggcctaaa ttggagcctt tttatgttga 120 

gcggtattcc tggagteage ttaaaaagct gettgeegat accagaaaat atcatggcta 180 

catgatgget aaggcaccac atgatttcat gtttgtgaag aggaatgatc cagatggacc 240 

tcattcagac agaatetatt accttgccat gtctggtgag aacagagaaa atacactgtt 300 

ttattctgaa attcccaaaa ctatcaatag ageageagte ttaatgetet ettggaagee 360 

tcttttggat ctttttcagg caacactgga ctatggaatg tattetegag aagaagaact 420 

attaagagaa agaaaacgca ttggaacagt eggaattget tettaegatt atcaccaagg 480 

aagtggaaca tttctgtttc aagccggtag tggaatttat cacgtaaaag atggagggcc 540 

acaaggattt acgcaacaac etttaaggee caatetagtg gaaactagtt gtcccaacat 600 

aeggatggat ccaaaattat gccctgctga tccagactgg attgctttta tacatagcaa 660 

cgatatttgg atatctaaca tcgtaaccag agaagaaagg agactcactt atgtgcacaa 720 

tgagetagee aacatggaag aagatgccag atcagctgga gtcgctacct ttgttctcca 780 

agaagaattt gatagatatt ctggctattg gtggtgtcca aaagctgaaa caactcccag 840 

tggtggtaaa attettagaa ttctatatga agaaaatgat gaatctgagg tggaaattat 900 

tcatgttaca tcccctatgt tggaaacaag gagggeagat tcattccgtt atcctaaaac 960 

aggtacagca aatcctaaag tcacttttaa gatgtcagaa ataatgattg atgetgaagg 1020 

aaggatcata gatgteatag ataaggaact aattcaacct tttgagattc tatttgaagg 1080 

agttgaatat attgccagag ctggatggac tcctgaggga aaatatgett ggtccatcct 1140 

actagatcgc tcccagactc gcctgcagat agtgttgatc tcacctgaat tatttatccc 1200 

agtagaagat gatgttatgg aaaggeagag actcattgag tcagtgcctg attctgtgac 1260 

gccactaatt atetatgaag aaacaacaga catctggata aatatccatg acatctttca 1320 

tgtttttccc caaagtcacg aagaggaaat tgagtttatt tttgcctctg aatgcaaaac 1380 

aggtttccgt catttataca aaattacatc tattttaaag gaaagcaaat ataaaegate 1440 

cagtggtggg ctgcctgctc caagtgattt caagtgtcct atcaaagagg agatagcaat 1500 

taccagtggt gaatgggaag ttcttggccg gcatggatct aatatccaag ttgatgaagt 1560 

cagaaggctg gtatattttg aaggcaccaa agactcccct ttagageate acctgtacgt 1620 

agtcagttac gtaaatcctg gagaggtgac aaggctgact gaccgtggct actcacattc 1680 

ttgctgcatc agtcagcact gtgaettett tataagtaag tatagtaacc agaagaatcc 1740 

acactgtgtg tccctttaca agctatcaag tcctgaagat gacccaactt gcaaaacaaa 1800 

ggaattttgg gccaccattt tggattcagc aggtcctctt cctgactata ctcctccaga i860 

aattttctct tttgaaagta ctactggatt tacattgtat gggatgetet acaagcctca 1920 
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tgatctacag cctggaaaga aatatcctac 
gcagttggtg aataatcgat ttaaaggagt 
tctaggttat gtggttgtag tgatagacaa 
tgaaggcgcc tttaaatata aaatgggtca 
ccaatatcta gcttctcgat atgatttcat 
gtcctatgga ggatacctct ccctgatggc 
tgctattgct ggggccccag tcactctgtg 
ttatatgggt caccctgacc agaatgaaca 
agcagaaaag ttcccctctg aaccaaatcg 
gaatgtccat tttgcacata ccagtatatt 
atatgattta cagatctatc ctcaggagag 
acattatgaa ctgcatcttt tgcactacct 
tctaaaagtg atatgagcgg ccgcgagctc 

<210> 3 

<211> 863 

<212> PRT 

<213> Homo sapiens 

<4 00> 3 


tgtgctgttc atatatggtg gtcctcaggt 
caagtatttc cgcttgaata ccctagcctc 
caggggatcc tgtcaccgag ggcttaaatt 
aatagaaatt gacgatcagg tggaaggact 
tgacttagat cgtgtgggca tccacggctg 
attaatgcag aggtcagata tcttcagggt 
gatcttctat gatacaggat acacggaacg 
gggctattac ttaggatctg tggccatgca 
tttactgctc ttacatggtt tcctggatga 
actgagtttt ttagtgaggg ctggaaagcc 
acacagcata agagttcctg aatcgggaga 
tcaagaaaac cttggatcac gtattgctgc 
c 


Met Ala Thr Thr Gly Thr Pro Thr 

1 5 

Thr Asp Asp Pro Ala Ala Arg Phe 

20 

Gly Leu Arg Ser lie lie His Gly 
35 40 

Val Asn Lys Ala Pro His Asp Phe 
50 55 

Ser Gly Pro His Ser His Arg Leu 
65 70 

Ser Arg Glu Asn Ser Leu Leu Tyr 

85 

Lys Glu Ala Leu Leu Leu Leu Ser 

100 

Gin Ala Thr Pro His His Gly Val 
115 120 

Arg Glu Arg Lys Arg Leu Gly Val 
130 135 

His Ser Glu Ser Gly Leu Phe Leu 
145 150 

His Cys Arg Asp Gly Gly Lys Asn 

165 

Pro Leu Glu lie Lys Thr Gin Cys 

180 

lie Cys Pro Ala Asp Pro Ala Phe 
195 200 

Leu Trp Val Ala Asn lie Glu Thr 
210 215 

Cys His Gin Gly Leu Ser Asn Val 
225 230 

Val Ala Thr Phe Val lie Gin Glu 

245 

Trp Trp Cys Pro Thr Ala Ser Trp 

260 

Leu Arg lie Leu Tyr Glu Glu Val 
275 280 

His Val Pro Ser Pro Ala Leu Glu 
290 295 

Tyr Pro Arg Thr Gly Ser Lys Asn 
305 310 


Ala Asp Arg Gly Asp Ala Ala Ala 
10 15 

Gin Val Gin Lys His Ser Trp Asp 
25 30 

Ser Arg Lys Tyr Ser Gly Leu He 

45 

Gin Phe Val Gin Lys Thr Asp Glu 

60 

Tyr Tyr Leu Gly Met Pro Tyr Gly 
75 80 

Ser Glu lie Pro Lys Lys Val Arg 
90 95 

Trp Lys Gin Met Leu Asp His Phe 
105 110 

Tyr Ser Arg Glu Glu Glu Leu Leu 

125 

Phe Gly lie Thr Ser Tyr Asp Phe 

14 0 

Phe Gin Ala Ser Asn Ser Leu Phe 
155 160 

Gly Phe Met Val Ser Pro Met Lys 
170 175 

Ser Gly Pro Arg Met Asp Pro Lys 
185 190 

Phe Ser Phe lie Asn Asn Ser Asp 

205 

Gly Glu Glu Arg Arg Leu Thr Phe 

220 

Leu Asp Asp Pro Lys Ser Ala Gly 
235 240 

Glu Phe Asp Arg Phe Thr Gly Tyr 
250 255 

Glu Gly Ser Glu Gly Leu Lys Thr 
265 270 

Asp Glu Ser Glu Val Glu Val He 

285 

Glu Arg Lys Thr Asp Ser Tyr Arg 

300 

Pro Lys lie Ala Leu Lys Leu Ala 
315 320 


1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2671 


6 


WO 02/31134 PCT/TJS01/31874 


GlU 

Phe 

Gin 

Thr 

Asp 

Ser 

Gin 

Gly 

Lys 

lie 

val 

Ser 

Thr 

Gin 

Glu 

Lys 





325 





330 





335 


Glu 

Leu 

Val 

Gin 

Pro 

Phe 

Ser 

Ser 

Leu 

Phe 

Pro 

Lys 

Val 

Glu 

Tyr 

lie 




340 





345 





350 



Ala 

Arg 

Ala 

Gly 

Trp 

Thr 

Arg 

Asp 

Gly 

Lys 

Tyr 

Ala 

Trp 

Ala 

Met 

Phe 



355 





360 





365 




Leu 

Asp 

Arg 

Pro 

Gin 

Gin 

Trp 

Leu 

Gin 

Leu 

Val 

Leu 

Leu 

Pro 

Pro 

Ala 


370 





375 





380 





Leu 

Phe 

He 

Pro 

Ser 

Thr 

Glu 

Asn 

Glu 

Glu 

Gin 

Arg 

Leu 

Ala 

Ser 

Ala 

385 





390 





395 





400 

Arg 

Ala 

Val 

Pro 

Arg 

Asn 

Val 

Gin 

Pro 

Tyr 

Val 

Val 

Tyr 

Glu 

Glu 

Val 





405 





410 





415 


Thr 

Asn 

Val 

Trp 

lie 

Asn 

Val 

His 

Asp 

lie 

Phe 

Tyr 

Pro 

Phe 

Pro 

Gin 




420 





425 





430 



Ser 

Glu 

Gly 

Glu 

Asp 

Glu 

Leu 

Cys 

Phe 

Leu 

Arg 

Ala 

Asn 

Glu 

Cys 

Lys 



435 





440 





445 




Thr 

Gly 

Phe 

Cys 

His 

Leu 

Tyr 

Lys 

Val 

Thr 

Ala 

Val 

Leu 

Lys 

Ser 

Gin 


450 





455 





460 





Gly 

Tyr 

Asp 

Trp 

Ser 

Glu 

Pro 

Phe 

Ser 

Pro 

Gly 

Glu 

Asp 

Glu 

Phe 

Lys 

465 





470 





475 





480 

Cys 

Pro 

He 

Lys 

GlU 

Glu 

lie 

Ala 

Leu 

Thr 

Ser 

Gly 

Glu 

Trp 

Glu 

Val 





485 





490 





495 


Leu 

Ala 

Arg 

His 

Gly 

Ser 

Lys 

lie 

Trp 

Val 

Asn 

Glu 

Glu 

Thr 

Lys 

Leu 




500 





505 





510 



Val 

Tyr 

Phe 

Gin 

Gly 

Thr 

Lys 

Asp 

Thr 

Pro 

Leu 

GlU 

His 

His 

Leu 

Tyr 



515 





520 





525 




Val 

Val 

Ser 

Tyr 

Glu 

Ala 

Ala 

Gly 

Glu 

lie 

Val 

Arg 

Leu 

Thr 

Thr 

Pro 


530 





535 





540 





Gly 

Phe 

Ser 

His 

Ser 

Cys 

Ser 

Met 

Ser 

Gin 

Asn 

Phe 

Asp 

Met 

Phe 

Val 

545 





550 





555 





560 

Ser 

His 

Tyr 

ser 

Ser 

Val 

Ser 

Thr 

Pro 

Pro 

Cys 

Val 

His 

Val 

i 

Tyr 

Lys 





565 





570 





575 


Leu 

Ser 

Gly 

Pro 

Asp 

Asp 

Asp 

Pro 

Leu 

His 

Lys 

Gin 

Pro 

Arg 

Phe 

Trp 




580 





585 





590 



Ala 

Ser 

Met 

Met 

Glu 

Ala 

Ala 

Ser 

Cys 

Pro 

Pro 

Asp 

Tyr 

Val 

Pro 

Pro 



595 





600 





605 




Glu 

lie 

Phe 

His 

Phe 

His 

Thr 

Arg 

Ser 

Asp 

Val 

Arg 

Leu 

Tyr 

Gly 

Met 


610 





615 





620 





He 

Tyr 

Lys 

Pro 

His 

Ala 

Leu 

Gin 

Pro 

Gly 

Lys 

Lys 

His 

Pro 

Thr 

Val 

625 





630 





635 





640 

Leu 

Phe 

val 

Tyr 

Gly 

Gly 

Pro 

Gin 

Val 

Gin 

Leu 

Val 

Asn 

Asn 

Ser 

Phe 





645 





650 





655 


Lys 

Gly 

lie 

Lys 

Tyr 

Leu 

Arg 

Leu 

Asn 

Thr 

Leu 

Ala 

Ser 

Leu 

Gly 

Tyr 




660 





665 





670 



Ala 

Val 

Val 

Val 

lie 

Asp 

Gly 

Arg 

Gly 

Ser 

Cys 

Gin 

Arg 

Gly 

Leu 

Arg 



675 





6 80 





665 




Phe 

Glu 

Gly 

Ala 

Leu 

Lys 

Asn 

Gin 

Met 

Gly 

Gin 

Val 

Glu 

lie 

Glu 

Asp 


690 





695 





700 





Gin 

Val 

Glu 

Gly 

Leu 

Gin 

Phe 

Val 

Ala 

Glu 

Lys 

Tyr 

Gly 

Phe 

lie 

Asp 

705 





710 





715 





720 

Leu 

Ser 

Arg 

Val 

Ala 

lie 

His 

Gly 

Trp 

Ser 

Tyr 

Gly 

Gly 

Phe 

Leu 

Ser 





725 





730 





735 


Leu 

Met 

Gly 

Leu 

lie 

His 

Lys 

Pro 

Gin 

Val 

Phe 

Lys 

Val 

Ala 

lie 

Ala 




740 





745 





750 



Gly 

Ala 

Pro 

Val 

Thr 

Val 

Trp 

Met 

Ala 

Tyr 

Asp 

Thr 

Gly 

Tyr 

Thr 

Glu 



755 





760 





765 
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Arg Tyx Met Asp Val Pro Glu Asn Asn Gin His Gly Tyr Glu Ala Gly 
770 775 700 

Ser Val Ala Leu His Val Glu Lys Leu Pro Asn Glu Pro Asn Arg Leu 

70S 790 795 800 

Leu lie Leu His Gly Phe Leu Asp Glu Asn Val His Phe Phe His Thr 

805 810 815 

Asn Phe Leu Val Ser Gin Leu lie Arg Ala Gly Lys Pro Tyr Gin Leu 

820 825 830 

Gin lie Tyr Pro Asn Glu Arg His Ser He Arg Cys Pro Glu Ser Gly 
835 840 845 

Glu His Tyr Glu Val Thr Leu Leu His Phe Leu Gin Glu Tyr Leu 
850 855 860 

<210> 4 

<211> 2617 

<212> DNA 

<213> Homo sapiens 

<400> 4 

caagcttacc atggccacca ccgggacccc aacggccgac cgaggcgacg cagcccrccac 60 

agatgacccg gccgcccgct tccaggtgca gaagcactcg tgggacgggc tccggagcat 120 

catccacggc agccgcaagt actcgggcct cattgtcaac aaggcgcccc acgacCtcca 180 

gttCgtgcag aagacggatg agtctgggcc ccactcccac cgcctctact acctgggaat 240 

gccatatggc agccgagaga actccctcct ctactctgag attcccaaga aggtccggaa 300 

agaggctctg ctgctcctgt cctggaagca gatgctggat catttccagg ccacgcccca 360 

ccatggggtc tactctcggg aggaggagct gctgagggag cggaaacgcc tgggggtctt 420 

cggcatcacc tcctacgact tccacagcga gagtggcctc ttcctcttcc aggccagcaa 480 

cagcctcttc cactgtcgcg acggcggcaa gaacggcttc atggtgtccc ctatgaaacc 540 

gctggaaatc aagacccagt gctcagggcc ccggatggac cccaaaatct gccctgccga 600 

ccctgccttc ttctccttca tcaataacag cgacctgtgg gtggccaaca tcgagacagg 660 

cgaggagcgg cggctgacct tctgccacca aggtttatcc aatgtcctgg atgaccccaa 720 

gtctgcgggt gtggccacct tcgtcataca ggaagagttc gaccgcttca ctgggtactg 780 

gtggtgcccc acagcctcct gggaaggttc agagggcctc aagacgctgc gaatcctgta 840 

tgaggaagtc gatgagtccg aggtggaggt cattcacgtc ccctctcctg cgctagaaga 900 

aaggaagacg gactcgtatc ggtaccccag gacaggcagc aagaatccca agattgcctt 960 

gaaactggct gagttccaga ctgacagcca gggcaagatc gtctcgaccc aggagaagga 1020 

gctggtgcag cccttcagct cgctgttccc gaaggtggag tacatcgcca gggccgggtg 1080 

gacccgggat ggcaaatacg cctgggccat gttcctggac cggccccagc agtggctcca 1140 

gctcgtcctc ctccccccgg ccct 9 ttcat cccgagcaca gagaatgagg agcagcggct 1200 

agcctctgcc agagctgtcc ccaggaatgt ccagccgtat gtggtgtacg aggaggtcac 1260 

caacgtctgg atcaatgttc atgacatctt ctatcccttc ccccaatcag agggagagga 1320 

cgagctctgc tttctccgcg ccaatgaatg caagaccggc ttctgccatt tgtacaaagt 1380 

caccgccgtt ttsaa&tccc agggctacga ttggagtgag cccttcagcc ccggggaaga 1440 

tgaatttaag tgccccatta aggaagagat tgctctgacc agcggtgaat gggaggtttt isoo 

ggcgaggcac ggctccaaga tctgggtcaa tgaggagacc aagctggtgt acttccaggg 1560 

caccaaggac acgccgctgg agcaccacct ctacgtggtc agctatgagg cggccggcga 1620 

gatcgtacgc ctcaccacgc ccggcttctc ccatagctgc tccatgagcc agaacttcga 1680 

catgttcgtc agccactaca gcagcgtgag cacgccgccc tgcgtgcacg tctacaagct 1740 

gagcggcccc gacgacgacc ccctgcacaa gcagccccgc ttctgggcta gcatgatgga 1800 

ggcagccagc tgccccccgg attatgttcc tccagagatc ttccatttcc acacgcgctc i860 

ggatgtgcgg ctctacggca tgatctacaa gccccacgcc ttgcagccag ggaagaagca 1920 

ccccaccgtc ctctttgtat atggaggccc ccaggtgcag ctggtgaata actccttcaa 1980 

aggcatcaag tacttgcggc tcaacacact ggcctccctg ggctacgccg tggttgtgat 2040 

tgacggcagg ggctcctgtc agcgagggct tcggttcgaa ggggccctga aaaaccaaat 2100 

gggccaggtg gagatcgagg accaggtgga gggcctgcag ttcgtggccg agaagtatgg 2160- 

cttcatcgac ctgagccgag ttgccatcca tggctggtcc tacgggggct tcctctcgct 2220 

catggggcta atccacaagc cccaggtgtt caaggtggcc atcgcgggtg ccccggtcac 2280 

cgtctggatg gcctacgaca cagggtacac tgagcgctac atggacgtcc ctgagaacaa 2340 

ccagcacggc tatgaggcgg gttccgtggc cctgcacgtg gagaagctgc ccaatgagcc 2400 

caaccgcttg cttatcctcc acggcttcct ggacgaaaac gtgcactttt tccacacaaa 2460 
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cttcctcgtc tcccaactga tccgagcagg gaaaccttac cagctccaga tctaccccaa 

cgagagacac agtattcgct gccccgagtc gggcgagcac tatgaagtca cgttgctgca 

ctttctacag gaatacctct gagcggccgc ggatccg 

<210> 5 

<211> 796 

<212> PRT 

<213 > Homo sapiens 

<400> 5 


Met 

Asn 

Gin 

Thr 

Ala 

Ser 

Val 

Ser 

His 

His 

lie 

Lys 

Cys 

Gin 

Pro 

Ser 

1 




5 





10 





15 


Lys 

Thr 

lie 

Lys 

Glu 

Leu 

Gly 

Ser 

Asn 

Ser 

Pro 

Pro 

Gin 

Arg 

Asn 

Trp 




20 





25 





30 



Lys 

Gly 

lie 

Ala 

He 

Ala 

Leu 

Leu 

Val 

lie 

Leu 

Val 

Val 

Cys 

Ser 

Leu 



35 





40 





45 




He 

Thr 

Met 

Ser 

Val 

lie 

Leu 

Leu 

Ser 

Pro 

Asp 

Glu 

Leu 

Thr 

Asn 

Ser 


50 





55 





60 





Ser 

GlU 

Thr 

Arg 

Leu 

Ser 

Leu 

Glu 

Asp 

Leu 

Phe 

Arg 

Lys 

Asp 

Phe 

Val 

6 5 





70 





75 





80 

Leu 

His 

Asp 

Pro^Glu 

Ala 

Arg 

Trp 

lie 

Asn 

Asp 

Thr 

Asp 

Val 

Val 

Tyr 





85 





90 





95 


Lys 

Ser 

Glu 

Asn 

Gly 

His 

Val 

lie 

Lys 

Leu 

Asn 

lie 

GlU 

Thr 

Asn 

Ala 




100 





105 





110 



Thr 

Thr 

Leu 

Leu 

Leu 

Glu 

Asn 

Thr 

Thr 

Phe 

Val 

Thr 

Phe 

Lys 

Ala 

Ser 



115 





120 





125 




Arg 

His 

Ser 

val 

Ser 

Pro 

Asp 

Leu 

Lys 

Tyr 

Val 

Leu 

Leu 

Ala 

Tyr 

Asp 


130 





135 





140 





Val 

Lys 

Gin 

lie 

Phe 

His 

Tyr 

Ser 

Tyr 

Thr 

Ala 

Ser 

Tyr 

val 

lie 

Tyr 

145 





150 





155 





160 

Asn 

lie 

His 

Thr 

Arg 

Glu 

Val 

Trp 

Glu 

Leu 

Asn 

Pro 

Pro 

Glu 

Val 

Glu 





165 





170 





175 


Asp 

Ser 

Val 

Leu 

Gin 

Tyr 

Ala 

Ala 

Trp 

Gly 

Val 

Gin 

Gly 

Gin 

Gin 

Leu 




180 





185 





190 



lie 

Tyr 

lie 

Phe 

Glu 

Asn 

Asn 

lie 

Tyr 

Tyr 

Gin 

Pro 

Asp 

lie 

Lys 

Ser 



195 





200 





205 




Ser 

Ser 

Leu 

Arg 

Leu 

Thr 

Ser 

Ser 

Gly 

Lys 

Glu 

Glu 

lie 

lie 

Phe 

Asn 


210 





215 





220 





Gly 

lie 

Ala 

Asp 

Trp 

Leu 

Tyr 

Glu 

Glu 

Glu 

Leu 

Leu 

His 

Ser 

His 

lie 

225 





230 





235 





240 

Ala 

His 

Trp 

Trp 

Ser 

Pro 

Asp 

Gly 

Glu 

Arg 

Leu 

Ala 

Phe 

Leu 

Met 

lie 





245 





250 





255 


Asn 

ASp 

Ser 

Leu 

Val 

Pro 

Thr 

Met 

Val 

lie 

Pro 

Arg 

Phe 

Thr 

Gly 

Ala 




260 





265 





270 



Leu 

Tyr 

Pro 

Lys 

Gly 

Lys 

Gin 

Tyr 

Pro 

Tyr 

Pro 

Lys 

Ala 

Gly 

Gin 

val 



275 





280 





285 




Asn 

Pro 

Thr 

lie 

Lys 

Leu 

Tyr 

Val 

Val 

Asn 

Leu 

Tyr 

Gly 

Pro 

Thr 

His 


290 





295 





300 





Thr 

Leu 

Glu 

Leu 

Met 

Pro 

Pro 

Asp 

Ser 

Phe 

Lys 

Ser 

Arg 

Glu 

Tyr 

Tyr 

305 





310 





315 





320 

lie 

Thr 

Met 

Val 

Lys 

Trp 

Val 

Ser 

Asn 

Thr 

Lys 

Thr 

Val 

Val 

Arg 

Trp 





325 





330 





335 


Leu 

Asn 

Arg 

Pro 

Gin 

Asn 

lie 

Ser 

lie 

Leu 

Thr 

Val 

Cys 

Glu 

Thr 

Thr 




340 





345 





350 



Thr 

Gly 

Ala 

Cys 

Ser 

Lys 

Lys 

Tyr 

Glu 

Met 

Thr 

Ser 

Asp 

Thr 

Trp 

Leu 



355 





360 





365 




Ser 

Gin 

Gin 

Asn 

Glu 

Glu 

Pro 

Val 

Phe 

Ser 

Arg 

Asp 

Gly 

Ser 

Lys 

Phe 


370 





375 





380 





Phe 

Met 

Thr 

Val 

Pro 

val 

Lys 

Gin 

Gly 

Gly 

Arg 

Gly 

Glu 

Phe 

His 

His 

385 





390 





395 





400 


2520 

2S60 

2617 
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He 

Ala 

Met 

Phe 

Leu 

lie 

Gin 

Ser 

Lys 

Ser 

Glu 

Gin 

lie 

Thr 

Val 

Arg 





405 





410 





415 


His 

Leu 

Thr 

Ser 

Gly 

Asn 

Trp 

Glu 

Val 

He 

Lys 

lie 

Leu 

Ala 

Tyr 

Asp 




420 





425 





430 



Glu 

Thr 

Thr 

Gin 

Lys 

lie 

Tyr 

Phe 

Leu 

Ser 

Thr 

Glu 

Ser 

Ser 

Pro 

Arg 



435 





440 





445 




Gly 

Arg 

Gin 

Leu 

Tyr 

Ser 

Ala 

Ser 

Thr 

GlU 

Gly 

Leu 

Leu 

Asn 

Arg 

Gin 


450 





455 





460 





Cys 

lie 

Ser 

Cys 

Asn 

Phe 

Met 

Lys 

Glu 

Gin 

Cys 

Thr 

Tyr 

Phe 

Asp 

Ala 

465 





470 





475 





480 

ser 

Phe 

Ser 

Pro 

Met 

Asn 

Gin 

His 

Phe 

Leu 

Leu 

Phe 

Cys 

Glu 

Gly 

Pro 





485 





490 





495 


Arg 

Val 

Pro 

Val 

val 

Ser 

Leu 

His 

Ser 

Thr 

Asp 

Asn 

Pro 

Ala 

Lys 

Tyr 




500 





505 





510 



Phe 

lie 

Leu 

Glu 

Ser 

Asn 

Ser 

Met 

Leu 

Lys 

Glu 

Ala 

He 

Leu 

Lys 

Lys 



515 





520 





525 




Lys 

lie 

Gly 

Lys 

Pro 

Glu 

lie 

Lys 

He 

Leu 

His 

lie 

Asp 

Asp 

Tyr 

Glu 


530 





535 





540 





Leu 

Pro 

Leu 

Gin 

Leu 

Ser 

Leu 

Pro 

Lys 

Asp 

Phe 

Met 

Asp 

Arg 

Asn 

Gin 

^ ^ 





CCA 

V 





SEE 





560 

Tyr 

Ala 

Leu 

Leir 

Leu 

lie 

Met 

Asp 

Glu 

Glu 

Pro 

Gly 

Gly 

Gin 

Leu 

Val 




565 





570 





575 


Thr 

Asp 

Lys 

Phe 

His 

He 

Asp 

Trp 

Asp 

Ser 

Val 

Leu 

lie 

Asp 

Met 

Asp 




580 





585 





590 



Asn 

Val 

lie 

val 

Ala 

Arg 

Phe 

Asp 

Gly 

Arg 

Gly 

Ser 

Gly 

Phe 

Gin 

Gly 



595 





600 





605 




Leu 

Lys 

lie 

Leu 

Gin 

GlU 

lie 

His 

Arg 

Arg 

Leu 

Gly 

Ser 

val 

Glu 

Val 


610 





615 





620 





Lys 

Asp 

Gin 

lie 

Thr 

Ala 

Val 

Lys 

Phe 

Leu 

Leu 

Lys 

Leu 

Pro 

Tyr 

lie 

625 





630 





635 





640 

Asp 

Ser 

Lys 

Arg 

Leu 

Ser 

lie 

Phe 

Gly 

Lys 

Gly 

Tyr 

Gly 

Gly 

Tyr 

lie 





645 





650 





655 


Ala 

Ser 

Met 

lie 

Leu 

Lys 

Ser 

Asp 

Glu 

Lys 

Leu 

Phe 

Lys 

Cys 

Gly 

Ser 




660 





665 





670 



Val 

val 

Ala 

Pro 

lie 

Thr 

Asp 

Leu 

Lys 

Leu 

Tyr 

Ala 

Ser 

Ala 

Phe 

Ser 



675 





680 





685 




Glu 

Arg 

Tyr 

Leu 

Gly 

Met 

Pro 

Ser 

Lys 

Glu 

Glu 

Ser 

Thr 

Tyr 

Gin 

Ala 


690 





695 





700 





Ala 

Ser 

Val 

Leu 

His 

Asn 

Val 

His 

Gly 

Leu 

Lys 

Glu 

Glu 

Asn 

lie 

Leu 

705 





710 





715 





720 

lie 

lie 

His 

Gly Thr 

Ala 

Asp 

Thr 

Lys 

Val 

His 

Phe 

Gin 

His 

Ser 

Ala 





725 





730 





735 


Glu 

Leu 

lie 

Lys 

His 

Leu 

lie 

Lys 

Ala 

Gly 

Val 

Asn 

Tyr 

Thr 

Met 

Gin 




740 





745 





750 



Val 

Tyr 

Pro 

Asp 

Glu 

Gly 

His 

Asn 

Val 

Ser 

Glu 

Lys 

Ser 

Lys 

Tyr 

His 



755 





760 





765 




Leu 

Tyr 

Ser 

Thr 

lie 

Leu 

Lys 

Phe 

Phe 

Ser 

Asp 

Cys 

Leu 

Lys 

Glu 

Glu 


770 





775 





780 





lie 

ser 

Val 

Leu 

Pro 

Gin 

Glu 

Pro 

Glu 

Glu 

Asp 

Glu 





785 





790 





795 







< 210 > 6 

<211> 2583 

<212 > DNA 

<213> Homo sapiens 

<4 0 0 > 6 

gcctgggatt gtgcactgtc cagggtcctg aaacatgaac caaactgcca gcgtgtccca 
tcacatcaag tgtcaaccct caaaaacaat caaggaactg ggaagtaaca gccctccaca 
gagaaactgg aagggaattg ctattgctct gctggtgatt ttagttgtat gctcactcat 


60 

120 

180 
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cactatgtca 

gtctttggaa 

caatgataca 

aacaaatgct 

acattcagtt 

tcattattcg 

gttaaatcct 

gcagcagctg 

ttcattgcga 

gttatatgaa 

aagacttgcc 

tactggagcg 

cccaacaata 

gccacctgac 

taccaagact 

tgagaccact 

tcagcagaat 

tgttaagcaa 

aagtgagcaa 

gacafcacgat 

aaggcagctg 

tttcatgaaa 

cttattattc 

agcaaaatat 

gataggaaag 

gtcccttccc 

agaaccagga 

tgacatggat 

gaaaattttg 

agctgtgaaa 

aaagggttat 

atgtggatcc 

aagatacctt 

taatgttcat 

agttcatttc 

tactatgcag 

ctacagcaca 

acaggaacca 

aggctcaatg 

cagcttacgg 

cac 


gtcatcctct 

gacctcttta 

gatgtggtgt 

accacattat 

tcaccagatt 

tatactgctt 

ccagaagtag 

atttatattt 

ctgacatctt 

gaggaactcc 

ttcctgatga 

ttgtatccca 

aaattatatg 

agctttaaat 

gtggtaagat 

acaggtgctt 

gaggagcccg 

gggggacgtg 

attaccgtgc 

gaaactactc 

tacagtgctt 

gaacaatgta 

tgtgaaggtc 

tttatattgg 

ccagaaatta 

aaagatttta 

ggccagctgg 

aatgtcattg 

caggagattc 

tttttgctga 

ggtggctata 

gtggttgcac 

gggatgccat 

ggcttgaaag 

caacactcag 

gtctacccag 

atcctcaaat 

gaagaagatg 

aaacctgaca 

agatgtcact 


taagcccaga 

ggaaagactt 

ataaaagcga 

tattggaaaa 

taaaatatgt 

catatgtgat 

aggactccgt 

ttgaaaataa 

ctggaaaaga 

tgcattctca 

taaatgactc 

aaggaaagca 

ttgtaaacct 

caagagaata 

ggttaaaccg 

gtagtaaaaa 

tgttttctag 

gagaatttca 

ggcatctgac 

aaaaaattta 

ctactgaagg 

catattttga 

caagggtccc 

aaagcaattc 

aaatccttca 

tggaccgaaa 

ttacagataa 

tagcaagatt 

atcgaagatt 

aactgcctta 

ttgcatcaat 

ctatcacaga 

ctaaggaaga 

aagaaaatat 

cagaattaat 

atgaaggtca 

tcttcagtga 

aataatggac 

aagagactgt 

ggagcagcac 


tgaactcaca aattcgtcag 
tgtgcttcac gatccagagg 
gaatggacat gtcattaaac 
cacaactttt gtaaccttca 
ccttctggca tatgatgtca 
ttacaacata cacactaggg 
cttgcagtac gcggcctggg 
tatctactat caacctgata 
agaaataatt tttaatggga 
catcgcccac tggtggtcac 
tttggtaccc accatggtta 
gtatccgtat cctaaggcag 
gtatggacca actcacactt 
ctatatcact atggttaaat 
acctcagaac atctccatcc 
atatgagatg acatcagata 
agacggcagc aaattcttta 
ccacatagct atgttcctca 
atcaggaaac tgggaagtga 
ctttctgagc actgaatctt 
attattgaat cgccaatgca 
tgccagtttt agtcccatga 
agtggtcagc ctacatagta 
tatgctgaag gaagctatcc 
tattgacgac tatgaacttc 
ccagtatgct cttctgttaa 
gttccatatt gactgggatt 
tgatggcaga ggaagtggat 
aggttcagta gaagtaaagg 
cattgactcc aaaagattaa 
gatcttaaaa tcagatgaaa 
cttgaaattg tatgcctcag 
aagcacttac caggcagcca 
attaataatt catggaactg 
caagcaccta ataaaagctg 
taacgtatct gagaagagca 
ttgtttgaag gaagaaatat 
cgtatttata cagaactgaa 
aatattgtag ttgctccaga 
gctcagagac agtgaactag 


aaaccagatt 

240 

ctcggtggat 

300 

tgaatataga 

360 

aagcatcaag 

420 

aacagatttt 

480 

aagtttggga 

540 

gtgtccaagg 

600 

taaagagcag 

660 

ttgctgactg 

720 

cagatggaga 

780 

tccctcggtt 

840 

gtcaagtgaa 

900 

tggagctcat 

960 

gggtaagcaa 

1020 

tcacagtctg 

1080 

cgtggctctc 

1140 

tgacagtgcc 

1200 

tccagagtaa 

1260 

taaagatctt 

1320 

ctcccagagg 

1380 

tttcatgtaa 

1440 

atcaacattt 

1500 

cggacaaccc 

1560 

tgaagaagaa 

1620 

ctttacagtt 

1680 

taatggatga 

1740 

ccgtactcat 

1800 

tccagggtct 

1860 

accaaataac 

1920 

gcatttttgg 

1980 

agctttttaa 

2040 

ctttctctga 

2100 

gtgtgctaca 

2160 

ctgacacaaa 

2220 

gagtgaatta 

2280 

agtatcatct 

2340 

ctgtgctacc 

2400 

gggaatattg 

2460 

atgtcaaggg 

2520 

catttgaata 

2580 

2583 


<210> 7 

<211> 690 

<212> PRT 

<213> Homo sapiens 

<400> 7 


Met 

1 

Ala 

Ala 

Ala 

Met 

5 

Glu 

Thr 

Glu 

Thr 

Ala 

Asp 

Cys 

20 

GlU 

Glu 

Asn 

lie 

GlU 

Pro 

Phe 

35 

Tyr 

Val 

Glu 

Arg 

Tyr 

40 

Leu 

Ala 

50 

Asp 

Thr 

Arg 

Lys 

Tyr 

55 

His 

His 

65 

Asp 

Phe 

Met 

Phe 

val 

70 

Lys 

Arg 

Asp 

Arg 

He 

Tyr 

Tyr 

85 

Leu 

Ala 

Met 


Gin Leu Gly Val Glu lie Phe Glu 
10 15 

Glu Ser Gin Asp Arg Pro Lys Leu 
25 30 

Ser Trp Ser Gin Leu Lys Lys Leu 

45 

Gly Tyr Met Met Ala Lys Ala Pro 

60 

Asn Asp Pro Asp Gly Pro His Ser 
75 80 

Ser Gly Glu Asn Arg Glu Asn Thr 
90 95 
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Leu Phe Tyr Ser Glu He Pro Lys Thr lie Asn Arg Ala Ala Val Leu 

100 105 110 

Met Leu Ser Trp Lys Pro Leu Leu Asp Leu Phe Gin Ala Thr Leu Asp 
H5 120 125 

Tyr Gly Met Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg Lys Arg 
130 135 140 


lie Gly Thr Val Gly lie Ala Ser 

145 150 

Thr Phe Leu Phe Gin Ala Gly Ser 

165 

Gly Pro Gin Gly Phe Thr Gin Gin 

180 

Thr Ser Cys Pro Asn lie Arg Met 
195 200 

Pro Asp Trp lie Ala Phe lie His 
210 215 
He Val Thr Arg Glu Glu Arg Arg 
225 230 

Ala Asn Met Glu Glu Asp Ala Arg 

245 

Leu Gin Glu Glu^Ehe Asp Arg Tyr 

260 

Ala Glu Thr Thr Pro Ser Gly Gly 
275 280 

Glu Asn Asp Glu Ser Glu Val Glu 
290 295 

Leu Glu Thr Arg Arg Ala Asp Ser 
305 310 

Ala Asn Pro Lys Val Thr Phe Lys 

325 

Glu Gly Arg lie lie Asp Val lie 

340 

Glu lie Leu Phe Glu Gly Val Glu 
355 360 

Pro Glu Gly Lys Tyr Ala Trp Ser 
370 375 

Arg Leu Gin lie Val Leu lie Ser 
305 390 

Asp Asp Val Met Glu Arg Gin Arg 

405 

Val Thr Pro Leu lie lie Tyr Glu 

420 

lie His Asp He Phe His Val Phe 
435 440 

Glu Phe lie Phe Ala Ser Glu Cys 
450 455 

Lys lie Thr Ser lie Leu Lys Glu 
465 470 

Gly Leu Pro Ala Pro Ser Asp Phe 

485 

Ala lie Thr Ser Gly Glu Trp Glu 

500 

lie Gin Val Asp Glu Val Arg Arg 
515 520 

Asp Ser Pro Leu Glu His His Leu 
530 535 

Gly Glu Val Thr Arg Leu Thr Asp 
545 550 

lie Ser Gin His Cys Asp Phe Phe 

565 


Tyr Asp Tyr His Gin Gly Ser Gly 
155 160 

Gly lie Tyr His Val Lys Asp Gly 
170 175 

Pro Leu Arg Pro Asn Leu Val Glu 
185 190 

Asp Pro Lys Leu Cys Pro Ala Asp 

205 

Ser Asn Asp lie Trp lie Ser Asn 

220 

Leu Thr Tyr Val His Asn Glu Leu 
235 240 

Ser Ala Gly Val Ala Thr Phe Val 
250 255 

Ser Gly Tyr Trp Trp Cys Pro Lys 
265 270 

Lys lie Leu Arg lie Leu Tyr Glu 

285 

lie lie His Val Thr Ser Pro Met 

300 

Phe Arg Tyr Pro Lys Thr Gly Thr 
315 320 

Met Ser Glu lie Met lie Asp Ala 
330 335 

Asp Lys Glu Leu lie Gin Pro Phe 
345 350 

Tyr lie Ala Arg Ala Gly Trp Thr 

365 

lie Leu Leu Asp Arg Ser Gin Thr 

380 

Pro Glu Leu Phe lie Pro Val Glu 
395 400 

Leu lie Glu Ser Val Pro Asp Ser 
410 415 

Glu Thr Thr Asp lie Trp lie Asn 
425 430 

Pro Gin Ser His Glu Glu Glu lie 

445 

Lys Thr Gly Phe Arg His Leu Tyr 

460 

Ser Lys Tyr Lys Arg Ser Ser Gly 
475 480 

Lys Cys Pro lie Lys Glu Glu lie 
490 495 

Val Leu Gly Arg His Gly Ser Asn 
505 510 

Leu Val Tyr Phe Glu Gly Thr Lys 

525 

Tyr Val Val Ser Tyr Val Asn Pro 

540 

Arg Gly Tyr Ser His Ser Cys Cys 
555 560 

lie Ser Lys Tyr Ser Asn Gin Lys 
570 575 
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Asn 

Pro 

His 

Cys 

Val 

Ser 

Leu 

Tyr 

Lys 

Leu 

Ser 

Ser 

Pro 

Glu 

Asp 

Asp 




580 





565 





590 



Pro 

Thr 

Cys 

Lys 

Thr 

Lys 

Glu 

Phe 

Trp 

Ala 

Thr 

lie 

Leu 

Asp 

Ser 

Ala 



595 





600 





605 




Gly 

Pro 

Leu 

Pro 

Asp 

Tyr 

Thr 

Pro 

Pro 

Glu 

lie 

Phe 

Ser 

Phe 

Glu 

Ser 


610 





615 





620 





Thr 

Thr 

Gly 

Phe 

Thr 

Leu 

Tyr 

Gly 

Met 

Leu 

Tyr 

Lys 

Pro 

His 

Asp 

Leu 

625 





630 





635 





640 

Gin 

Pro 

Gly 

Lys 

Lye 

Tyr 

Pro 

Thr 

Val 

Leu 

Phe 

lie 

Tyr 

Gly 

Gly 

Arg 





645 





650 





655 


Leu 

Leu 

Leu 

Leu 

Gly 

Pro 

Gin 

Ser 

Leu 

Cys 

Gly 

Ser 

Ser 

Met 

lie 

Gin 




660 





665 





670 



Asp 

Thr 

Arg 

Asn 

Val 

He 

Trp 

Val 

Thr 

Leu 

Thr 

Arg 

Met 

Asn 

Arg 

Ala 



675 





680 





685 




lie 

Thr 
















690 
















< 210 > 8 

<211> 4523 

<212> DMA . _ 

<213> Homo sapiens 
<400> 8 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 1260 

caaccttttg agattctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 1320 

gagggaaaat atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 1380 

ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 1440 

attgagtcag tgcctgattc tgtgacgcca ctaattatct atgaagaaac aacagacatc 1500 

tggataaata tccatgacat ctttcatgtt tttccccaaa gtcacgaaga ggaaattgag 1560 

tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 1620 

ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 1680 

tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggccggcat 1740 

ggatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 1800 

tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 1860 

ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 1920 

agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 1980 

gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagcaggt 2040 

cctcttcctg actatactcc tccagaaatt ttctcttttg aaagtactac tggatttaca 2100 

ttgtatggga tgctctacaa gcctcatgat ctacagcctg gaaagaaata tcctactgtg 2160 
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ctgttcatat atggtggtcg gttgctattg ctggggcccc agtcactctg tggatcttct 
atgatacagg atacacggaa cgttatatgg gtcaccctga ccagaatgaa cagggctatt 
acttaggatc tgtggccatg caagcagaaa agttcccctc tgaaccaaat cgtttactgc 
tcttacatgg tttcctggat gagaatgtcc attttgcaca taccagtata ttactgagtt 
ttttagtgag ggctggaaag ccatatgatt tacagatcta tcctcaggag agacacagca 
taagagttcc tgaatcggga gaacattatg aactgcatct tttgcactac cttcaagaaa 
accttggatc acgtattgct gctctaaaag tgatataatt ttgacctgtg tagaactctc 
tggtatacac tggctattta accaaatgag gaggtttaat caacagaaaa cacagaattg 
atcatcacat tttgatacct gccatgtaac atctactcct gaaaataaat gtggtgccat 
gcaggggtct acggtttgtg gtagtaatct aataccttaa ccccacatgc tcaaaatcaa 
atgatacata ttcctgagag acccagcaat accataagaa ttactaaaaa aaaaaaaaaa 
aaaaagacat tagcaccatg tattcatact accctatttt cacttttaat agtattataa 
acttcatgaa cttaattagt gtatttttac agtatacttt tgagtttgtt aaaatatgat 
gatattagtg attggtttgg ttcagttcca gaatctttga ctagttacag atttgatagc 
acttaaatgt aattgaatag cttatgcttc attgcttggg catatccagc atgttatgaa 
ctaataacta ttaaacttga cttaaccagt cattcattaa taatttttca aggataactt 
agtggcctcc taaagacact tgttttggca ctgaccagtt tttagccaat ttaatctgta 
tctagtataa ataattctca tttttctttg atgatattaa cagagtgggc ttttcctttt 
gcataaaggc tagtaactgt atatgtagca tggatttaat tagtcatgat attgataatt 
acaggcagaa aatttttaat caaatgatta gagcttaaat atttgcaggc aagttttttt 
ttttecttta agaaaaggaa aaagtacaca ttcactagaa ttcttcagaa aatttagtgg 
tgccagtttc catttggtat ttccttatta aaatattcta gaattttaag gagattgaag 
ggaatcacag tggggtgggg agacctgggt ttggggaatg acagagagaa gaggtggtga 
gggcctgatt aaaaactaag cagaagtagt tttaacaaaa atactcatga aaatgtttgg 
aaactgaaat ttaaacaact gtaatattaa ggaaaccaga atcaataaat cactgtcttg 
ccagcacagc tacagagtaa catgattcag gggaggaaaa gttccttaga gttactttta 
taattctttt tttttttcct cttaggttta gaaatcttac aaatttaaac tttatccttt 
taaaattatt tgaacataat ttagatattg taagcttaaa atacaaatgt ttatagataa 
cctctttacc ataaactaat ccctggcaag ccatggctct cttttttttt ttggtgttta 
aagcctgtaa acagtttttc tgaatgatca tgaacttttc ttggtttagc actaggattt 
agctatgaag agagctcata ggctttcagg tgctaattga gatctgccct gttagagtct 
tggggtgcta gattggtcac attgacacca gtggcaggga aggcatctat gagtttgatg 
ctttttatca cacacttcag tgtttagaaa gttattacca atacttttaa acaacactcc 
aagaaaattt gctatatttc tttctcatca ctacagagag agtagatttc cccatagaga 
gcacagcctc cattagtaag gttggtgact attggtaaga ggtggacttc attgacacca 
agtgggaggt agggaaagcc cagaaatggc aggatgatat ggtggttctg tcgttgggaa 
aggtattggg ttttgctgtt tgtatttata ctgtataata gataccacgc tttttcttat 
tatctgtata tgtattgctt ttcatgtttg atattttccc atgccaagat ttgtttatat 
atattttcaa tgttaaatta aattgatttg ggtaactttc ttccccaaga aagtattttc 
ccccttaagt ataaatctga ctg 

< 210 > 9 

<211> 241 

<212> PRT 

<213> Homo sapiens 

<400> 9 


Met 

1 

Ala 

Ala 

Ala 

Met 

5 

Glu 

Thr 

Glu 

Gin 

Leu 

10 

Gly Val 

Glu 

lie 

Phe 

15 

Glu 

Thr 

Ala 

Asp 

cys 

20 

Glu 

Glu 

Asn 

He 

Glu 

25 

Ser 

Gin 

Asp 

Arg 

Pro 

30 

Lys 

Leu 

Glu 

Pro 

Phe 

35 

Tyr 

Val 

Glu 

Arg 

Tyr 

40 

Ser 

Trp 

Ser 

Gin 

Leu 

45 

Lys 

Lys 

Leu 

Leu 

Ala 

50 

Asp 

Thr 

Arg 

Lys 

Tyr 

55 

His 

Gly 

Tyr 

Met 

Met 

60 

Ala 

Lys 

Ala 

Pro 

His 

65 

Asp 

Phe 

Met 

Phe 

Val 

70 

Lys 

Arg 

Asn 

Asp 

Pro 

75 

Asp 

Gly 

Pro 

His 

Ser 

80 

Asp Arg 

He 

Tyr 

Tyr 

85 

Leu 

Ala 

Met 

Ser 

Gly 

90 

Glu 

Asn 

Arg 

Glu 

Asn 

95 

Thr 

Leu 

Phe 

Tyr 

Ser 

Glu 

lie 

Pro 

Lys 

Thr 

lie 

Asn 

Arg 

Ala 

Ala 

Val 

Leu 


2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4523 
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100 





105 





110 



Met 

Leu 

Ser 

Trp 

Lys 

Pro 

Leu 

Leu 

Asp 

Leu 

Phe 

Gin 

Ala 

Thr 

Leu 

Asp 



115 





120 





125 




Tyr 

Gly Met 

Tyr 

Ser 

Arg 

Glu 

Glu 

Glu 

Leu 

Leu 

Arg 

GlU 

Arg 

Lys 

Arg 


130 





135 





140 





lie 

Gly Thr 

Val 

Gly 

He 

Ala 

Ser 

Tyr 

Asp 

Tyr 

His 

Gin 

Gly 

Ser 

Gly 

145 





150 





155 





160 

Thr 

Phe 

Leu 

Phe 

Gin 

Ala 

Gly 

Ser 

Gly 

lie 

Tyr 

His 

Val 

Lys 

Asp 

Gly 





165 





170 





175 


Gly 

Pro 

Gin 

Gly 

Phe 

Thr 

Gin 

Gin 

Pro 

Leu 

Arg 

Pro 

Asn 

Leu 

Val 

Glu 




180 





185 





190 



Thr 

Ser 

Cys 

Pro 

Asn 

lie 

Arg 

Met 

Asp 

Pro 

Lys 

Leu 

Cys 

Pro 

Ala 

Asp 



195 





200 





205 




Pro 

Asp 

Trp 

lie 

Ala 

Phe 

lie 

His 

Ser 

Asn 

Asp 

lie 

Trp 

lie 

Ser 

Asn 


210 





215 





220 





He 

Val 

Thr 

Arg 

Glu 

Glu 

Arg 

Arg 

Leu 

Thr 

Tyr 

Val 

His 

Asn 

Gly 

Lys 

225 





230 





235 





240 


Ala 

< 210 > 10 

< 211 > 1356 

<212> DNA 

<213> Homo sapiens 

<400> 10 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatggt aaggcgtagt tcttcagatt tacttttctg 960 

aacagtattt tttgaagtat aatttgctgc ttgcattttg aaattagatt accacgttgg 1020 

gtgatcttta tatttgaaat tcaagtcttt aaaattttta aaaaatggag aaaagtacag 1080 

aggataactt gtatgtacca catgtataat attcatttta atgttttaat gttcattttc 1140 

aaacagtgaa acaaaagaac ctctgacatg attgttcttt tagcttgcta agactgccag 1200 

aattttccca aaactgttct tattaaaata aaattttagg ctaggcatgg tggctcatgc 1260 

ctgtaatcct agcactctgg gaggctgagg caggcagatt gtttgagccc agaagttcaa 1320 

gatcaggatg ggcaacatgg tgacacctcg tttgac 1356 

< 210 > 11 

<21l> 661 

<212> PRT 

<213> Homo sapiens 

<400> 11 


Met 

1 

Ala 

Ala 

Ala 

Met 

5 

GlU 

Thr 

Glu 

Gin 

Leu Gly 
10 

Val 

Glu 

lie 

Phe 

15 

Glu 

Thr 

Ala 

Asp 

Cys 

20 

Glu 

GlU 

Asn 

lie 

Glu 

25 

Ser 

Gin 

Asp 

Arg 

Pro 

30 

Lys 

Leu 

Glu 

Pro 

Phe 

Tyr 

Val 

Glu 

Arg 

Tyr 

Ser 

Trp 

Ser 

Gin 

Leu 

Lys 

Lys 

Leu 


15 


' "■ il A ' 


WO 02/31134 




PCT/US01/31874 


35 40 

Leu Ala Asp Thr Arg Lys Tyr His 
50 55 

His Asp Phe Met Phe Val Lys Arg 
65 70 

Asp Arg lie Tyr Tyr Leu Ala Met 

65 

Leu Phe Tyr Ser Glu lie Pro Lys 

100 

Met Leu Ser Trp Lys Pro Leu Leu 
115 120 

Tyr Gly Met Tyr Ser Arg Glu Glu 
130 135 

lie Gly Thr Val Gly lie Ala Ser 
145 150 

Thr Phe Leu Phe Gin Ala Gly Ser 

165 

Gly Pro Gin Gly Phe Thr Gin Gin 

180 

Thr Ser Cys Pro Asn lie Arg Met 

TOC r\ A 

A V V 


45 

Gly Tyr Met Met Ala Lys Ala Pro 

60 

Asn Asp Pro Asp Gly Pro His Ser 
75 80 

Ser Gly Glu Asn Arg Glu Asn Thr 
90 95 

Thr lie Asn Arg Ala Ala Val Leu 
105 110 

Asp Leu Phe Gin Ala Thr Leu Asp 

125 

Glu Leu Leu Arg Glu Arg Lys Arg 

140 

Tyr Asp Tyr His Gin Gly Ser Gly 
155 160 

Gly lie Tyr His Val Lys Asp Gly 
170 175 

Pro Leu Arg Pro Asn Leu Val Glu 
185 190 

Asp Pro Lys Leu Cys Pro Ala Asp 

MAP* 


Pro Asp Trp lie Ala Phe lie His 
210 215 
lie Val Thr Arg Glu Glu Arg Arg 
225 230 

Ala Asn Met Glu Glu Asp Ala Arg 

245 

Leu Gin Glu Glu Phe Asp Arg Tyr 

260 

Ala Glu Thr Thr Pro Ser Gly Gly 
275 280 

Glu Asn Asp Glu Ser Glu Val Glu 
290 295 

Leu Glu Thr Arg Arg Ala Asp Ser 
305 310 

Ala Asn Pro Lys Val Thr Phe Lys 

325 

Glu Gly Arg lie lie Asp Val lie 

340 

Glu lie Leu Phe Glu Gly Val Glu 
355 360 

Pro Glu Gly Lys Tyr Ala Trp Ser 
370 375 

Arg Leu Gin lie Val Leu lie Ser 
385 390 

Asp Asp Val Met Glu Arg Gin Arg 

405 

Val Thr Pro Leu lie lie Tyr Glu 

420 

lie His Asp lie Phe His Val Phe 


Ser Asn Asp lie Trp He Ser Asn 

220 

Leu Thr Tyr val His Asn Glu Leu 
235 240 

Ser Ala Gly Val Ala Thr Phe Val 
250 255 

Ser Gly Tyr Trp Trp Cys Pro Lys 
265 270 

Lys lie Leu Arg lie Leu Tyr Glu 

285 

lie lie His Val Thr Ser Pro Met 

300 

Phe Arg Tyr Pro Lys Thr Gly Thr 
315 320 

Met Ser Glu lie Met lie Asp Ala 
330 335 

Asp Lys Glu Leu lie Gin Pro Phe 
345 350 

Tyr lie Ala Arg Ala Gly Trp Thr 

365 

lie Leu Leu Asp Arg Ser Gin Thr 

380 

Pro Glu Leu Phe lie Pro Val Glu 
395 400 

Leu lie Glu Ser Val Pro Asp Ser 
410 415 

Glu Thr Thr Asp lie Trp lie Asn 
425 430 

Pro Gin Ser His Glu Glu Glu lie 


435 440 445 

Glu Phe lie Phe Ala Ser Glu Cys Lys Thr Gly Phe Arg His Leu Tyr 
450 455 460 


Lys lie Thr Ser lie Leu Lys Glu Ser Lys Tyr Lys Arg Ser Ser Gly 

465 470 475 480 

Gly Leu Pro Ala Pro Ser Asp Phe Lys Cys Pro lie Lys Glu Glu lie 

485 490 495 

Ala lie Thr Ser Gly Glu Trp Glu Val Leu Gly Arg His Gly Ser Asn 

500 505 510 

lie Gin Val Asp Glu Val Arg Arg Leu Val Tyr Phe Glu Gly Thr Lys 
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Asp 

Ser 

515 

Pro 

Leu 

Glu 

His 

His 

Gly 

530 

Glu 

Val 

Thr 

Arg 

Leu 

535 

Thr 

545 

He 

Ser 

Gin 

His 

Cys 

550 

Asp 

Phe 

Asn 

Pro 

His 

Cys 

565 

Val 

Ser 

Leu 

Pro 

Thr 

Cys 

580 

Lys 

Thr 

Lys 

Glu 

Gly 

Pro 

595 

Leu 

Pro 

Asp 

Tyr 

Thr 

Thr 

610 

Thr 

Gly 

Phe 

Thr 

Leu 

615 

Tyr 

625 

Gin 

Pro 

Gly 

Lys 

Lys 

630 

Tyr 

Pro 

Leu 

Arg 

Cys 

Ser 

660 

645 

Trp 




< 210 > 12 
<211> 4829 

<212> DNA 
<213> Homo sapiens 
<4 00 > 12 

aagtgctaaa gcctccgagg ccaaggccgc 
cgttcgccgc ctgggttgtc accggcgccg 
gagtggaggc ggcgcagcat gaagcggcgc 
tccgggcggg gccgggggga aggaaaatgc 
ctgggtgttg agatatttga aactgcggac 
cctaaattgg agccttttta tgttgagcgg 
gccgatacca gaaaatatca tggctacatg 
gtgaagagga atgatccaga tggacctcat 
ggtgagaaoa gagaaaatac actgttttat 
gcagtcttaa tgctctcttg gaagcctctt 
ggaatgtatt ctcgagaaga agaactatta 
attgcttctt acgattatca ccaaggaagt 
atttatcacg taaaagatgg agggccacaa 
ctagtggaaa ctagttgtcc caacatacgg 
gactggattg cttttataca tagcaacgat 
gaaaggagac tcacttatgt gcacaatgag 
gctggagtcg ctacctttgt tctccaagaa 
tgtccaaaag ctgaaacaac tcccagtggt 
aatgatgaat ctgaggtgga aattattcat 
gcagattcat tccgttatcc taaaacaggt 
tcagaaataa tgattgatgc tgaaggaagg 
caaccttttg agattctatt tgaaggagtt 
gagggaaaat atgcttggtc catcctacta 
ttgatctcac ctgaattatt tatcccagta 
attgagtcag tgcctgattc tgtgacgcca 
tggataaata tccatgacat ctttcatgtt 
tttatttttg cctctgaatg caaaacaggt 
ttaaaggaaa gcaaatataa acgatccagt 
tgtcctatca aagaggagat agcaattacc 
ggatctaata tccaagttga tgaagtcaga 
tcccctttag agcatcacct gtacgtagtc 
ctgactgacc gtggctactc acattcttgc 
agtaagtata gtaaccagaa gaatccacac 
gaagatgacc caacttgcaa aacaaaggaa 


520 





525 




Leu 

Tyr 

Val 

val 

Ser 

540 

Tyr 

Val 

Asn 

Pro 

Asp 

Arg 

Gly Tyr 
555 

Ser 

His 

Ser 

Cys 

Cys 

560 

Phe 

lie 

Ser 

570 

Lys 

Tyr 

Ser 

Asn 

Gin 

575 

Lys 

Tyr 

Lys 

585 

Leu 

Ser 

Ser 

Pro 

Glu Asp 
590 

Asp 

Phe 

600 

Trp 

Ala 

Thr 

lie 

Leu 

605 

Asp 

Ser 

Ala 

Pro 

Pro 

Glu 

lie 

Phe 

620 

Ser 

Phe 

Glu 

Ser 

Gly 

Met 

Leu 

Tyr 

635 

Lys 

Pro 

His 

Asp 

Leu 

640 

Thr 

Val 

Leu 

650 

Phe 

lie 

Tyr 

Gly Gly 
655 

Leu 


tgctactgcc gccgctgctt cttagtgccg 
ccgccgagga agccactgca accaggaccg 
aggcccgctc catagcgcac gtcgggacgg 
aacatggcag cagcaatgga aacagaacag 
tgtgaggaga atattgaatc acaggatcgg 
tattcctgga gtcagcttaa aaagctgctt 
atggctaagg caccacatga tttcatgttt 
tcagacagaa tctattacct tgccatgtct 
tctgaaattc ccaaaactat caatagagca 
ttggatcttt ttcaggcaac actggactat 
agagaaagaa aacgcattgg aacagtcgga 
ggaacatttc tgtttcaagc cggtagtgga 
ggatttacgc aacaaccttt aaggcccaat 
atggatccaa aattatgccc tgctgatcca 
atttggatat ctaacatcgt aaccagagaa 
ctagccaaca tggaagaaga tgccagatca 
gaatttgata gatattctgg ctattggtgg 
ggtaaaattc ttagaattct atatgaagaa 
gttacatccc ctatgttgga aacaaggagg 
acagcaaatc ctaaagtcac ttttaagatg 
atcatagatg tcatagataa ggaactaatt 
gaatatattg ccagagctgg atggactcct 
gatcgctccc agactcgcct acagatagtg 
gaagatgatg ttatggaaag gcagagactc 
ctaattatct atgaagaaac aacagacatc 
tttccccaaa gtcacgaaga ggaaattgag 
ttccgtcatt tatacaaaat tacatctatt 
ggtgggotgo ctgctccaag tgatttcaag 
agtggtgaat gggaagttct tggccggcat 
aggctggtat attttgaagg caccaaagac 
agttacgtaa atcctggaga ggtgacaagg 
tgcatcagtc agcactgtga cttctttata 
tgtgtgtccc tttacaagct atcaagtcct 
ttttgggcca ccattttgga ttcagcaggt 


60 

120 

180 

240 

300 

360 

420 

460 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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cctcttcctg 

ttgtatggga 

ctgttcatat 

agtatttccg 

ggggatcctg 

tagaaattga 

acttagatcg 

taatgcagag 

tcttctatga 

gctattactt 

tactgctctt 

tgagtttttt 

acagcataag 

aagaaaacct 

actctctggt 

gaattgatca 

tgccatgcag 

aatcaaatga 

aaaaaaaaaa 

ttataaactt 


catgatgata 

gatagcactt 

tatgaactaa 

taacttagtg 

tctgtatcta 

ccttttgcat 

ataattacag 

tttttttttt 

tagtggtgcc 

ttgaagggaa 

tggtgagggc 

gtttggaaac 

gtcttgccag 

cttttataat 

tccttttaaa 

agataacctc 

tgtttaaagc 

ggatttagct 

gagtcttggg 

ttgatgcttt 

cactccaaga 

tagagagcac 

acaccaagtg 

tgggaaaggt 

tcttattatc 

ttatatatat 

attttccccc 


actatactcc 
tgctctacaa 
atggtggtct 
cttgaatacc 
tcaccgaggg 
cgatcaggtg 
tgtgggcatc 
gtcagatatc 
tacaggatac 
aggatctgtg 
acatggtttc 
agtgagggct 
agttcctgaa 
tggatcacgt 
atacactggc 
tcacattttg 
gggtctacgg 
tacatattcc 
agacattagc 
catgaactta 
ctagtgattg 
aaatgtaatt 
taactattaa 
gcctcctaaa 
gtataaataa 
aaaggctagt 
gcagaaaatt 
cctttaagaa 
agtttccatt 
tcacagtggg 
ctgattaaaa 
tgaaatttaa 
cacagctaca 
tctttttttt 
attatttgaa 
tttaccataa 
ctgtaaacag 
atgaagagag 
gtgctagatt 
ttatcacaca 
aaatttgcta 
agcctccatt 
ggaggtaggg 
attgggtttt 
tgtatatgta 
tttcaatgtt 
ttaagtataa 


tccagaaatt 

gcctcatgat 

cctcaggtgc 

ctagcctctc 

cttaaatttg 

gaaggactcc 

cacggctggt 

ttcagggttg 

acggaacgtt 

gccatgcaag 

ctggatgaga 

ggaaagccat 

tcgggagaac 

attgctgctc 

tatttaacca 

atacctgcca 

tttgtggtag 

tgagagaccc 

accatgtatt 

attagtgtat 

gtttggttca 

gaatagctta 

acttgactta 

gacacttgtt 

ttctcatttt 

aactgtatat 

tttaatcaaa 

aaggaaaaag 

tggtatttcc 

gtggggagac 

actaagcaga 

acaactgtaa 

gagtaacatg 

tttcctctta 

cataatttag 

actaatccct 

tttttctgaa 

ctcataggct 

ggtcacattg 

cttcagtgtt 

tatttctttc 


ttctcttttg 

ctacagcctg 

agttggtgaa 

taggttatgt 

aaggcgcctt 

aatatctagc 

cctatggagg 

ctattgctgg 

atatgggtca 

cagaaaagtt 

atgtccattt 

atgatttaca 

attatgaact 

taaaagtgat 

aatgaggagg 

tgtaacatct 

taatctaata 

agcaatacca 

catactaccc 

ttttacagta 

gttccagaat 

tgcttcattg 

accagtcatt 

ttggcactga 

tctttgatga 

gtagcatgga 

tgattagagc 

tacacattca 

ttattaaaat 

ctgggtttgg 

agtagtttta 

tattaaggaa 

attcagggga 

ggtttagaaa 

atattgtaag 

ggcaagccat 

tgatcatgaa 

ttcaggtgct 

acaccagtgg 

tagaaagtta 

tcatcactac 


aaagtactac 

gaaagaaata 

taatcggttt 

ggttgtagtg 

taaatataaa 

ttctcgatat 

atacctctcc 

ggccccagtc 

ccctgaccag 

cccctctgaa 

tgcacatacc 

gatctatcct 

gcatcttttg 

ataattttga 

tttaatcaac 

actcctgaaa 

ccttaacccc 

taagaattac 

tattttcact 

tacttttgag 

ctttgactag 

cttgggcata 

cattaataat 

ccagttttta 

tattaacaga 

tttaattagt 

ttaaatattt 

ctagaattct 

attctagaat 

ggaatgacag 

acaaaaatac 

accagaatca 

ggaaaagttc 

tcttacaaat 

cttaaaatac 


agtaaggttg 

aaagcccaga 

gctgtttgta 

ttgcttttca 

aaattaaatt 


gtgactattg 

aatggcagga 

tttatactgt 

tgtttgatat 

gatttgggta 


ggctctcttt 

cttttcttgg 

aattgagatc 

cagggaaggc 

ttaccaatac 

agagagagta 

gtaagaggtg 

tgatatggtg 

ataatagata 

tttcccatgc 

actttcttcc 


atctgactg 


tggatttaca 2100 
tcctactgtg 2160 
aaaggagtca 2220 
atagacaaca 2280 
atgggtcaaa 2340 
gatttcattg 2400 
ctgatggcat 2460 
actctgtgga 2520 
aatgaacagg 2580 
ccaaatcgtt 2640 
agtatattac 2700 
caggagagac 2760 
cactaccttc 2820 
cctgtgtaga 2880 
agaaaacaca 2940 
ataaatgtgg 3000 
acatgctcaa 3060 
taaaaaaaaa 3120 
tttaatagta 3180 
tttgttaaaa 3240 
ttacagattt 3300 
tccagcatgt 3360 
ttttcaagga 3420 
gccaatttaa 3480 
gtgggctttt 3540 
catgatattg 3600 
gcaggcaagt 3660 
tcagaaaatt 3720 
tttaaggaga 3780 
agagaagagg 3840 
tcatgaaaat 3900 
ataaatcact 3960 
cttagagtta 4020 
ttaaacttta 4080 
aaatgtttat 4140 
ttttttttgg 4200 
tttagcacta 4260 
tgccctgtta 4320 
atctatgagt 4380 
ttttaaacaa 4440 
gatttcccca 4500 
gacttcattg 4560 
gttctgtcgt 4620 
ccacgctttt 4680 
caagatttgt 4740 
ccaagaaagt 4800 

4829 


<210> 13 

<211> 358 

<212> PRT 

<213 > Homo sapiens 

<400> 13 


Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu lie Phe Glu 
15 10 15 

Thr Ala Asp Cys Glu Glu Asn lie Glu Ser Gin Asp Arg Pro Lys Leu 

20 25 30 

Glu Pro Phe Tyr Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 
35 40 45 
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Leu 

Ala 

Asp 

Thr 

Arg 

Lys 

Tyr 

His 

Gly 

Tyr 

Met 

Met 

Ala 

Lys 

Ala 

Pro 


50 





55 





60 





His 

Asp 

Phe 

Met 

Phe 

Val 

Lys 

Arg 

Asn 

Asp 

Pro 

Asp 

Gly 

Pro 

His 

Ser 

65 





70 





75 





80 

Asp 

Arg 

lie 

Tyr 

Tyr 

Leu 

Ala 

Met 

Ser 

Gly 

Glu 

Asn 

Arg 

Glu 

Asn 

Thr 





85 





90 





95 


Leu 

Phe 

Tyr 

Ser 

Glu 

lie 

Pro 

Lys 

Thr 

lie 

Asn 

Arg 

Ala 

Ala 

Val 

Leu 




100 





105 





110 



Met 

Leu 

Ser 

Trp 

Lys 

Pro 

Leu 

Leu 

Asp 

Leu 

Phe 

Gin 

Ala 

Thr 

Leu 

Asp 



115 





120 





125 




Tyr 

Gly 

Met 

Tyr 

Ser 

Arg 

Glu 

Glu 

Glu 

Leu 

Leu 

Arg 

Glu 

Arg 

Lys 

Arg 


130 





135 





140 





He 

Gly 

Thr 

Val 

Gly 

lie 

Ala 

Ser 

Tyr 

Asp 

Tyr 

His 

Gin 

Gly 

Ser 

Gly 

145 





150 





155 





160 

Thr 

Phe 

Leu 

Phe 

Gin 

Ala 

Gly 

Ser 

Gly 

lie 

Tyr 

His 

Val 

Lys 

Asp 

Gly 





165 





170 





175 


Gly 

Pro 

Gin 

Gly 

Phe 

Thr 

Gin 

Gin 

Pro 

Leu 

Arg 

Pro 

Asn 

Leu 

Val 

Glu 




180 





185 





190 



Thr 

Ser 

Cys 

Pro 

Asn 

lie 

Arg 

Met 

Asp 

Pro 

Lys 

Leu 

Cys 

Pro 

Ala 

Asp 



195 





200 





205 




PlTG 

Asp 

- - - 

J.j.y 

x 

rLxa 

nV . 
rue 

Tl A 
X XC 

TtJ M 

axo 

n _ 

iJCA 

* AM 

AOiJl 

Asp 

Tl - 
X XG 

xx±/ 

T 1 * 

nx. 

OCX 

Asn 


210 





215 





220 





lie 

Val 

Thr 

Arg 

Glu 

Glu 

Arg 

Arg 

Leu 

Thr 

Tyr 

Val 

His 

Asn 

Glu 

Leu 

225 





230 





235 





240 

Ala 

Asn 

Met 

Glu 

Glu 

Asp 

Ala 

Arg 

Ser 

Ala 

Gly 

Val 

Ala 

Thr 

Phe 

Val 





245 





250 





255 


Leu 

Gin 

Glu 

Glu 

Phe 

Asp 

Arg 

Tyr 

Ser 

Gly 

Tyr 

Trp 

Trp 

Cys 

Pro 

Lys 




260 





265 





270 



Ala 

Glu 

Thr 

Thr 

Pro 

Ser 

Gly 

Gly 

Lys 

He 

Leu 

Arg 

lie 

Leu 

Tyr 

Glu 



275 





280 





285 




Glu 

Asn 

Asp 

Glu 

Ser 

Glu 

Val 

Glu 

lie 

lie 

His 

Val 

Thr 

Ser 

Pro 

Met 


290 





295 





300 





Leu 

Glu 

Thr 

Arg 

Arg 

Ala 

Asp 

Ser 

Phe 

Arg 

Tyr 

Pro 

Lys 

Thr 

Gly 

Thr 

305 





310 





315 





320 

Ala 

Asn 

Pro 

Lys 

val 

Thr 

Phe 

Lys 

Met 

Ser 

Glu 

lie 

Met 

lie 

Asp 

Ala 





325 





330 





335 


GlU 

Gly 

Arg 

Ser 

Lys 

Leu 

Met 

Lys 

Ser 

Glu 

Gly 

Trp 

Tyr 

lie 

Leu 

Lys 




340 





345 





350 



Ala 

Pro 

Lys 

Thr 

Pro 

Leu 












355 


<210> 14 

<211> 4309 

<212> DNA 

<213> Homo sapiens 

<400> 14 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 
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ctagtggaaa ctagttgtcc caacatacgg 
gactggattg cttttataca tagcaacgat 
gaaaggagac tcacttatgt gcacaatgag 
gctggagtcg ctacctttgt tctccaagaa 
tgtccaaaag ctgaaacaac tcccagtggt 
aatgatgaat ctgaggtgga aattattcat 
gcagattcat tccgttatcc taaaacaggt 
tcagaaataa tgattgatgc tgaaggaaga 
attttgaagg caccaaagac tcccctttag 
atcctggaga ggtgacaagg ctgactgacc 
agcactgtga cttctttata agtaagtata 
tttacaagct atcaagtcct gaagatgacc 
ccattttgga ttcagcaggt cctcttcctg 
aaagtactac tggatttaca ttgtatggga 
gaaagaaata tcctactgtg ctgttcatat 
taatcggttt aaaggagtca agtatttccg 
ggttgtagtg atagacaaca ggggatcctg 
taaatataaa atgggtcaaa tagaaattga 
ttctcgatat gatttcattg acttagatcg 
atacctctcc ctgatggcat taatgcagag 

<•. *- _ a. • *. __ 

ccctgaccag aatgaacagg gctattactt 
cccctctgaa ccaaatcgtt tactgctctt 
tgcacatacc agtatattac tgagtttttt 
gatctatcct caggagagac acagcataag 
gcatcttttg cactaccttc aagaaaacct 
ataattttga cctgtgtaga actctctggt 
tttaatcaac agaaaacaca gaattgatca 
actcctgaaa ataaatgtgg tgccatgcag 
ccttaacccc acatgctcaa aatcaaatga 
taagaattac taaaaaaaaa aaaaaaaaaa 
tattttcact tttaatagta ttataaactt 
tacttttgag tttgttaaaa tatgatgata 
ctttgactag ttacagattt gatagcactt 
cttgggcata tccagcatgt tatgaactaa 
cattaataat ttttcaagga taacttagtg 
ccagttttta gccaatttaa tctgtatcta 
tattaacaga gtgggctttt ccttttgcat 
tttaattagt catgatattg ataattacag 
ttaaatattt gcaggcaagt tttttttttt 
ctagaattct tcagaaaatt tagtggtgcc 
attctagaat tttaaggaga ttgaagggaa 
ggaatgacag agagaagagg tggtgagggc 
acaaaaatac tcatgaaaat gtttggaaac 
accagaatca ataaatcact gtcttgccag 
ggaaaagttc cttagagtta cttttataat 
tcttacaaat ttaaacttta tccttttaaa 
cttaaaatac aaatgtttat agataacctc 
ggctctcttt ttttttttgg tgtttaaagc 
cttttcttgg tttagcacta ggatttagct 
aattgagatc tgccctgtta gagtcttggg 
cagggaaggc atctatgagt ttgatgcttt 
ttaccaatac ttttaaacaa cactccaaga 
agagagagta gatttcccca tagagagcac 
gtaagaggtg gacttcattg acaccaagtg 
tgatatggtg gttctgtcgt tgggaaaggt 
ataatagata ccacgctttt tcttattatc 
tttcccatgc caagatttgt ttatatatat 
actttcttcc ccaagaaagt attttccccc 


atggatccaa aattatgccc tgctgatcca 
atttggatat ctaacatcgt aaccagagaa 
ctagccaaca tggaagaaga tgccagatca 
gaatttgata gatattctgg ctattggtgg 
ggtaaaattc ttagaattct atatgaagaa 
gttacatccc ctatgttgga aacaaggagg 
acagcaaatc ctaaagtcac ttttaagatg 
tccaagttga tgaagtcaga aggctggtat 
agcatcacct gtacgtagtc agttacgtaa 
gtggctactc acattcttgc tgcatcagtc 
gtaaccagaa gaatccacac tgtgtgtccc 
caacttgcaa aacaaaggaa ttttgggcca 
actatactcc tccagaaatt ttctcttttg 
tgctctacaa gcctcatgat ctacagcctg 
atggtggtct cctcaggtgc agttggtgaa 
cttgaatacc ctagcctctc taggttatgt 
tcaccgaggg cttaaatttg aaggcgcctt 
cgatcaggtg gaaggactcc aatatctagc 
tgtgggcatc cacggctggt cctatggagg 
gtcagatatc ttcagggttg ctattgctgg 
tacaggatac acggaacgtt atatgggtca 
aggatctgtg gccatgcaag cagaaaagtt 
acatggtttc ctggatgaga atgtccattt 
agtgagggct ggaaagccat atgatttaca 
agttcctgaa tcgggagaac attatgaact 
tggatcacgt attgctgctc taaaagtgat 
atacactggc tatttaacca aatgaggagg 
tcacattttg atacctgcca tgtaacatct 
gggtctacgg tttgtggtag taatctaata 
tacatattcc tgagagaccc agcaatacca 
agacattagc accatgtatt catactaccc 
catgaactta attagtgtat ttttacagta 
ttagtgattg gtttggttca gttccagaat 
aaatgtaatt gaatagctta tgcttcattg 
taactattaa acttgactta accagtcatt 
gcctcctaaa gacacttgtt ttggcactga 
gtataaataa ttctcatttt tctttgatga 
aaaggctagt aactgtatat gtagcatgga 
gcagaaaatt tttaatcaaa tgattagagc 
cctttaagaa aaggaaaaag tacacattca 
agtttccatt tggtatttcc ttattaaaat 
tcacagtggg gtggggagac ctgggtttgg 
ctgattaaaa actaagcaga agtagtttta 
tgaaatttaa acaactgtaa tattaaggaa 
cacagctaca gagtaacatg attcagggga 
tctttttttt tttcctctta ggtttagaaa 
attatttgaa cataatttag atattgtaag 
tttaccataa actaatccct ggcaagccat 
ctgtaaacag tttttctgaa tgatcatgaa 
atgaagagag ctcataggct ttcaggtgct 
gtgctagatt ggtcacattg acaccagtgg 
ttatcacaca cttcagtgtt tagaaagtta 
aaatttgcta tatttctttc tcatcactac 
agcctccatt agtaaggttg gtgactattg 
ggaggtaggg aaagcccaga aatggcagga 
attgggtttt gctgtttgta tttatactgt 
tgtatatgta ttgcttttca tgtttgatat 
tttcaatgtt aaattaaatt gatttgggta 
ttaagtataa atctgactg 


840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4309 
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<210> 15 

<211> 108 
<212 > PRT 
<213> Homo sapiens 
<400> 15 


Met 

Ala 

Ala 

Ala 

Met 

GlU 

Thr 

Glu 

Gin 

Leu 

Gly Val 

Glu 

lie 

Phe 

Glu 

1 




5 





10 





15 


Thr 

Ala 

Asp 

Cys 

Glu 

Glu 

Asn 

lie 

Glu 

Ser 

Gin Asp 

Arg 

Pro 

Lys 

Leu 




20 





25 





30 



Glu 

Pro 

Phe 

Tyr 

Val 

Glu 

Arg 

Tyr 

Ser 

Trp 

Ser 

Gin 

Leu 

Lys 

Lys 

Leu 



35 





40 





45 




Leu 

Ala 

Asp 

Thr 

Arg 

Lys 

Tyr 

His 

Gly 

Tyr 

Met 

Met 

Ala 

Lys 

Ala 

Pro 


50 





55 





60 





His 

Asp 

Phe 

Met 

Phe 

Val 

Lys 

Arg 

Asn 

Asp 

Pro 

Asp 

Gly 

Pro 

His 

Ser 

65 





70 





75 





80 

Asp 

Arg 

He 

Tyr 

Tyr 

Leu 

Gly 

Asn 

Lys 

Ser 

Leu 

lie 

Asp 

His 

Asp 

Arg 





85 





90 





95 


Phe 

Ser 

Lys 

Ser 

Lys 

Met 

Pro 

Glu 

lie 

Ala 

Ser 

Ser 








100 





105 









<210> 16 

<211> 620 

<212> DNA 

<213> Homo sapiens 

<400> 16 


aagtgctaaa 

cgttcgccgc 

gagtggaggc 

tccgggcggg 

ctgggtgttg 

cctaaattgg 

gccgatacca 

gtgaagagga 

tcattaattg 

agctagcttg 

tagtttgttc 


gcctccgagg 

ctgggttgtc 

ggcgcagcat 

gccgggggga 

agatatttga 

agccttttta 

gaaaatatca 

atgatccaga 

atcatgatcg 

aaatgccttt 

ccactcaaca 


ccaaggccgc 

accggcgccg 

gaagcggcgc 

aggaaaatgc 

aactgcggac 

tgttgagcgg 

tggctacatg 

tggacctcat 

tttttcaaaa 

ctttagatgg 


tgctactgcc 

ccgccgagga 

aggcccgctc 

aacatggcag 

tgtgaggaga 

tattcctgga 

atggctaagg 

tcagacagaa 

tcgaagatgc 

tctgattagg 


gccgctgctt 

agccactgca 

catagcgcac 

cagcaatgga 

atattgaatc 

gtcagcttaa 

caccacatga 

tctattacct 

cagaaattgc 

aaaacaaaca 


cttagtgccg 

accaggaccg 

gtcgggacgg 

aacagaacag 

acaggatcgg 

aaagctgctt 

tttcatgttt 

tggtaacaag 

ttcttcctaa 

ataaaaccat 


60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

620 


<210> 

17 

<211> 

194 

<212> 

PRT 

<213> 

Homo sapiens 

<40Q> 

17 
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Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu lie Phe Glu 
IS 10 15 

Thr Ala Asp Cys Glu Glu Asn lie Glu Ser Gin Asp Arg Pro Lys Leu 

20 25 30 
Glu Pro Phe Tyr Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 
35 40 45 

Leu Ala Asp Thr Arg Lys Tyr His Gly Tyr Met Met Ala Lys Ala Pro 
50 55 60 

His Asp Phe Met Phe Val Lys Arg Asn Asp Pro Asp Gly Pro His Ser 

65 70 75 80 

Asp Arg He Tyr Tyr Leu Ala Met Ser Gly Glu Asn Arg Glu Asn Thr 

85 90 95 

Leu Phe Tyr Ser Glu lie Pro Lys Thr lie Asn Arg Ala Ala Val Leu 

100 105 110 

Met Leu Ser Trp Lys Pro Leu Leu Asp Leu Phe Gin Ala Thr Leu Asp 
115 120 125 

Tyr Gly Met Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arq Lys Arq 
130 135 140 

He Gly Thr Val Gly lie Ala Ser Tyr Asp Tyr His Gin Gly Ser Gly 

145 150 155 160 

Thr Phe Leu Phe Gin Ala Gly ser Gly lie Tyr His Val Lys Asd Gly 

165 170 175 

Gly Pro Gin Gly Phe Thr Gin Pro Leu Arg Pro Asn Leu Val Glu Thr 

180 185 190 

Cys Ala 

< 210 > 18 

<211> 832 

<212> DNA 

<213> Homo sapiens 

<400> 18 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc wacaaccttt aaggcccaat 780 

ctagtggaaa ctasttgtsc caracytgca tgacccaatc agatcctgta ga 832 

<210> 19 

<211> 658 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu lie Phe Glu 

1 5 10 .15 

Thr Ala Asp Cys Glu Glu Asn lie Glu Ser Gin Asp Arg Pro Lys Leu 

20 25 30 

Glu Pro Phe Tyr Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 
35 40 45 

Leu Ala Asp Thr Arg Lys Tyr His Gly Tyr Met Met Ala Lys Ala Pro 
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His 

50 

ASp 

Phe 

Met 

Phe 

Val 

55 

Lys 

Arg 

65 

Asp 

Arg 

lie 

Tyr 

Tyr 

70 

Leu 

Ala 

Met 

Leu 

Phe 

Tyr 

Ser 

85 

Glu 

lie 

Pro 

Lys 

Met 

Leu 

Ser 

100 

Trp 

Lys 

Pro 

Leu 

Leu 

Tyr 

115 
Gly Met 

Tyr 

Ser 

Arg 

Glu 

120 

Glu 

He 

130 

Gly Thr 

Val 

Gly 

lie 

135 

Ala 

Ser 

145 

Thr 

Phe 

Leu 

Phe 

Gin 

150 

Ala 

Gly 

Ser 

Gly 

Pro 

Gin 

Gly 

165 

Phe 

Thr 

Gin 

Gin 

Thr 

Ser 

Cys 

180 

Pro 

Asn 

lie 

Arg 

Met 

Pro 

Asp 

195 

Trp 

lie 

Ala 

Phe 

lie 

200 

His 

lie 

210 

Val 

Thr 

Arg 

Glu 

Glu 

Arg 

Arg 

225 

Ala 

Asn 

Met 

Glu 

Glu 

230 

Asp 

Ala 

Arg 

Leu 

Gin 

Glu 

Glu 

245 

Phe 

Asp 

Arg 

Tyr 

Ala 

Glu 

Thr 

260 

Thr 

Pro 

Ser 

Gly 

Gly 

Glu 

275 
Asn Asp 

Glu 

Ser 

Glu 

Val 

280 

GlU 

Leu 

290 

Glu 

Thr 

Arg 

Arg 

Ala 

295 

Asp 

Ser 

305 

Ala 

Asn 

Pro 

Lys 

Val 

310 

Thr 

Phe 

Lys 

Glu Gly Arg 

lie 

325 

lie 

Asp 

Val 

He 

Glu 

lie 

Leu 

340 

Phe 

Glu 

Gly 

Val 

Glu 

Pro 

Glu 

355 

Gly 

Lys 

Tyr 

Ala 

Trp 

360 

Ser 

Arg 

370 

Leu 

Gin 

lie 

Val 

Leu 

375 

lie 

Ser 

385 

Asp Asp 

Val 

Met 

Glu 

390 

Arg 

Gin 

Arg 

Val 

Thr 

Pro 

Leu 

405 

lie 

lie 

Tyr 

Glu 

lie 

His 

Asp 

420 

lie 

Phe 

His 

Val 

Phe 

Glu 

Phe 

435 

lie 

Phe 

Ala 

Ser 

Glu 

440 

Cys 

Lys 

450 

lie 

Thr 

Ser 

lie 

Leu 

455 

Lys 

Glu 

465 

Gly 

Leu 

Pro 

Ala 

Pro 

470 

Ser 

Asp 

Phe 

Ala 

lie 

Thr 

Ser 

485 

Gly 

Glu 

Trp 

Glu 

lie 

Gin 

Val 

500 

Asp 

Glu 

Val 

Arg 

Arg 

Asp 

Ser 

515 

Pro 

Leu 

Glu 

His 

His 

520 

Leu 


60 


Asn Asp 

Pro Asp 
75 

Gly Pro 

His 

Ser 

60 

Ser Gly 
90 

Glu Asn 

Arg Glu 

Asn 

95 

Thr 

Thr 

105 

lie 

Asn Arg Ala Ala 

110 

val 

Leu 

Asp 

Leu 

Phe Gin 

Ala Thr 
125 

Leu 

Asp 

Glu 

Leu 

Leu Arg 
140 

Glu Arg 

Lys 

Arg 

Tyr Asp 

Tyr His 
155 

Gin Gly 

Ser 

Gly 

160 

Gly 

lie 

170 

Tyr His 

Val Lys 

Asp 

175 

Gly 

Pro 

185 

Leu 

Arg Pro 

Asn Leu 
190 

Val 

Glu 

Asp 

Pro 

Lys Leu 

Cys Pro 
205 

Ala 

Asp 

Ser 

Asn 

Asp lie 

Trp lie 

Ser 

Asn 

Leu 

Thr 

Tyr Val 
235 

His Asn 

Glu 

Leu 

240 

Ser 

Ala 

250 

Gly Val 

Ala Thr 

Phe 

255 

Val 

Ser 

265 

Gly 

Tyr Trp 

Trp Cys 
270 

Pro 

Lys 

Lys 

lie 

Leu Arg 

lie Leu 
285 

Tyr 

Glu 

lie 

lie 

His Val 
300 

Thr Ser 

Pro 

Met 

Phe 

Arg 

Tyr Pro 
315 

Lys Thr Gly 

Thr 

320 

Met 

Ser 

330 

Glu lie 

Met lie 

Asp 

335 

Ala 

Asp 

345 

Lys 

Glu Leu 

He Gin 
350 

Pro 

Phe 

Tyr 

lie 

Ala Arg Ala Gly Trp 
365 

Thr 

He 

Leu 

Leu Asp 
380 

Arg Ser 

Gin 

Thr 

Pro 

Glu 

Leu Phe 
395 

lie Pro 

Val 

Glu 

400 

Leu 

lie 

410 

Glu Ser 

Val Pro 

Asp 

415 

Ser 

Glu 

425 

Thr 

Thr Asp 

lie Trp 
430 

lie 

Asn 

Pro 

Gin 

Ser His 

Glu Glu 

445 

Glu 

lie 

Lys 

Thr 

Gly Phe Arg His 
460 

Leu 

Tyr 

Ser 

Lys 

Tyr Lys 
475 

Arg Ser 

Ser 

Gly 

480 

Lys 

Cys 

490 

Pro lie 

Lys Glu 

Glu 

495 

lie 

Val 

505 

Leu 

Gly Arg 

His Gly 
510 

Ser 

Asn 

Leu 

Val 

Tyr Phe 

Glu Gly Thr 
525 

Lys 

Tyr 

Val 

Val Ser 

Tyr Val 

Asn 

Pro 
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Gly Glu Val Thr Arg Leu Thr Asp Arg Gly Tyr Ser His Ser Cys Cys 

54S S50 555 560 

He Ser Gin His Cys Asp Phe Phe lie Ser Lys Tyr Ser Asn Gin Lys 

565 570 575 

Asn Pro His Cys Val Ser Leu Tyr Lys Leu Ser Ser Pro Glu Asp Asp 

580 585 590 

Pro Thr Cys Lys Thr Lys Glu Phe Trp Ala Thr lie Leu Asp Ser Ala 
595 600 605 

Gly Pro Leu Pro Asp Tyr Thr Pro Pro Glu lie Phe Ser Phe Glu Ser 
610 615 620 

Thr Thr Gly Phe Thr Leu Tyr Gly Met Leu Tyr Lys Pro His Asp Leu 

625 630 635 640 

Gin Pro Gly Lys Lys Tyr Pro Thr Val Leu Phe lie Tyr Gly Gly Arg 

645 650 655 

Val Lys 

< 210 > 20 
<211> 4676 

<212> DNA 

<2X3 ^ KCftiG aaplcHS 
<400> 20 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 1260 

caaccttttg agattctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 1320 

gagggaaaat atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 1380 

ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 1440 

attgagtcag tgcctgattc tgtgacgcca ctaattatct atgaagaaac aacagacatc 1500 

tggataaata tccatgacat ctttcatgtt tttccccaaa gtcacgaaga ggaaattgag 1560 

tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 1620 

ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 1680 

tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggccggcat 1740 

ggatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 1800 

tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 1860 

ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 1920 

agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 1980 

gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagcaggt 2040 

cctcttcctg actatactcc tccagaaatt ttctcttttg aaagtactac tggatttaca 2100 

ttgtatggga tgctctacaa gcctcatgat ctacagcctg gaaagaaata tcctactgtg 2160 

ctgttcatat atggtggtcg ggtcaaatag aaattgacga tcaggtggaa ggactccaat 2220 


a 7 


24 
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atctagcttc tcgatatgat ttcattgact tagatcgtgt gggcatccac ggctggtcct 2280 

atggaggata cctctccctg atggcattaa tgcagaggtc agatatcttc agggttgcta 2340 

ttgctggggc cccagtcact ctgtggatct tctatgatac aggatacacg gaacgttata 2400 

tgggtcaecc: tgaccagaat gaacagggct attacttagg atctgtggcc atgcaagcag 2460 

aaaagttccc ctctgaacca aatcgtttac tgctcttaca tggtttcctg gatgagaatg 2520 

tccattttgc acataccagt atattactga gttttttagt gagggctgga aagccatatg 2580 

atttacagat ctatcctcag gagagacaca gcataagagt tcctgaatcg ggagaacatt 2640 

atgaactgca tcttttgcac taccttcaag aaaaccttgg atcacgtatt gctgctctaa 2700 

aagtgatata attttgacct gtgtagaact ctctggtata cactggctat ttaaccaaat 2760 

gaggaggttt aatcaacaga aaacacagaa ttgatcatca cattttgata cctgccatgt 2820 

aacatctact cctgaaaata aatgtggtgc catgcagggg tctacggttt gtggtagtaa 2880 

tctaatacct taaccccaca tgctcaaaat caaatgatac atattcctga gagacccagc 2940 

aataccataa gaattactaa aaaaaaaaaa aaaaaaaaga cattagcacc atgtattcat 3000 

actaccctat tttcactttt aatagtatta taaacttcat gaacttaatt agtgtatttt 3060 

tacagtatac ttttgagttt gttaaaatat gatgatatta gtgattggtt tggttcagtt 3120 

ccagaatctt tgactagtta cagatttgat agcacttaaa tgtaattgaa tagcttatgc 3180 

ttcattgctt gggcatatcc agcatgttat gaactaataa ctattaaact tgacttaacc 3240 

agtcattcat taataatttt tcaaggataa cttagtggcc tcctaaagac acttgttttg 3300 

gcactgacca gtttttagcc aatttaatct gtatctagta taaataattc tcatttttct 3360 

ttgatgatat taacagagtg ggcttttcct tttgcataaa ggctagtaac tgtatatgta 3420 

gcatggattt aattagtcat gatattgata attacaggca gaaaattttc aatcaaatga 3480 

ttagagctta aatatttgca ggcaagtttt tttttttcct ttaagaaaag gaaaaagtac 3540 

acattcacta gaattcttca gaaaatttag tggtgccagt ttccatttgg tatttcctta 3600 

ttaaaatatt ctagaatttt aaggagattg aagggaatca cagtggggtg gggagacctg 3660 

ggtttgggga atgacagaga gaagaggtgg tgagggcctg attaaaaact aagcagaagt 3720 

agttttaaca aaaatactca tgaaaatgtt tggaaactga aatttaaaca actgtaatat 3780 

taaggaaacc agaatcaata aatcactgtc ttgccagcac agctacagag taacatgatt 3840 

caggggagga aaagttcctt agagttactt ttataattct tttttttttt cctcttaggt 3900 

ttagaaatct tacaaattta aactttatcc ttttaaaatt atttgaacat aatttagata 3960 

ttgtaagctt aaaatacaaa tgtttataga taacctcttt accataaact aatccctggc 4020 

aagccatggc tctctttttt tttttggtgt ttaaagcctg taaacagttt ttctgaatga 4080 

tcatgaactt ttcttggttt agcactagga tttagctatg aagagagctc ataggctttc 4140 

aggtgctaat tgagatctgc cctgttagag tcttggggtg ctagattggt cacattgaca 4200 

ccagtggcag ggaaggcatc tatgagtttg atgcttttta tcacacactt cagtgtttag 4260 

aaagttatta ccaatacttt taaacaacac tccaagaaaa tttgctatat ttctttctca 4320 

tcactacaga gagagtagat ttccccatag agagcacagc ctccattagt aaggttggtg 4380 

actattggta agaggtggac ttcattgaca ccaagtggga ggtagggaaa gcccagaaat 4440 

ggcaggatga tatggtggtt ctgtcgttgg gaaaggtatt gggttttgct gtttgtattt 4500 

atactgtata atagatacca cgctttttct tattatctgt atatgtattg cttttcatgt 4560 

ttgatatttt cccatgccaa gatttgttta tatatatttt caatgttaaa ttaaattgat 4620 

ttgggtaact ttcttcccca agaaagtatt ttccccctta agtataaatc tgactg 4676 

< 210 > 21 

<211> 613 

<212> PRT 

<213> Homo sapiens 

<400> 21 


Met 

1 

Ala 

Ala 

Ala 

Met 

5 

Glu 

Thr 

Glu 

Gin 

Leu 

10 

Gly 

Val 

Glu 

lie 

Phe 

15 

Glu 

Thr 

Ala 

Asp 

Cys 

20 

Glu 

Glu 

Asn 

lie 

Glu 

25 

Ser 

Gin 

Asp 

Arg 

Pro 

30 

Lys 

Leu 

Glu 

Pro 

Phe 

35 

Tyr 

Val 

Glu 

Arg 

Tyr 

40 

Ser 

Trp 

Ser 

Gin 

Leu 

45 

Lys 

Lys 

Leu 

Leu 

Ala 

50 

Asp 

Thr 

Arg 

Lys 

Tyr 

55 

His 

Gly 

Tyr 

Met 

Met 

60 

Ala 

Lys 

Ala 

Pro 

His 

65 

Asp 

Phe 

Met 

Phe 

Val 

70 

Lys 

Arg 

Asn 

Asp 

Pro 

75 

Asp 

Gly 

Pro 

His 

Ser 

80 

Asp 

Arg 

He 

Tyr 

Tyr 

85 

Leu 

Ala 

Met 

Ser 

Gly 

90 

Glu 

Asn 

Arg 

Glu 

Asn 

95 

Thr 


25 
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Leu Phe Tyr Ser Glu lie Pro Lys Thr He Asn Arg Ala Ala Val Leu 

100 105 110 

Met Leu Ser Trp Lys Pro Leu Leu Asp Leu Phe Gin Ala Thr Leu Asp 
115 120 125 

Tyr Gly Met Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg Lys Arg 
130 135 140 

lie Gly Thr Val Gly lie Ala Ser Tyr Asp Tyr His Gin Gly Ser Gly 

145 150 155 160 

Thr Phe Leu Phe Gin Ala Gly Ser Gly lie Tyr His Val Lys Asp Gly 

165 170 175 

Gly Pro Gin Gly Phe Thr Gin Gin Pro Leu Arg Pro Asn Leu Val Glu 

180 185 190 

Thr Ser Cys Pro Asn lie Arg Met Asp Pro Lys Leu Cys Pro Ala Asp 
195 200 205 

Pro Asp Trp lie Ala Phe lie His Ser Asn Asp lie Trp lie Ser Asn 
210 215 220 

lie Val Thr Arg Glu Glu Arg Arg Leu Thr Tyr Val His Asn Glu Leu 

225 230 235 240 

Ala Asn Met Glu Glu Asp Ala Arg Ser Ala Gly Val Ala Thr Phe Val 

245 250 255 

Leu Gin Glu Glu Phe Asp Arg Tyr Ser Gly Tyr Trp Trp Cys Pro Lys 

260 265 270 

Ala Glu Thr Thr Pro Ser Gly Gly Lys lie Leu Arg lie Leu Tyr Glu 
275 280 285 

Glu Asn Asp Glu Ser Glu Val Glu lie lie His Val Thr Ser Pro Met 
290 295 300 

Leu Glu Thr Arg Arg Ala Asp Ser Phe Arg Tyr Pro Lys Thr Gly Thr 

305 310 315 320 

Ala Asn Pro Lys Val Thr Phe Lys Met Ser Glu lie Met lie Asp Ala 

325 330 335 

Glu Gly Arg lie He Asp Val lie Asp Lys Glu Leu lie Gin Pro Phe 

340 345 350 

Glu lie Leu Phe Glu Gly Val Glu Tyr lie Ala Arg Ala Gly Trp Thr 
355 360 365 

Pro Glu Gly Lys Tyr Ala Trp Ser lie Leu Leu Asp Arg Ser Gin Thr 
370 375 380 

Arg Leu Gin lie Val Leu lie Ser Pro Glu Leu Phe lie Pro Val Glu 

385 390 395 400 

Asp Asp Val Met Glu Arg Gin Arg Leu lie Glu Ser Val Pro Asp Ser 

405 410 415 

Val Thr Pro Leu lie lie Tyr Glu Glu Thr Thr Asp lie Trp lie Asn 

420 425 430 

lie His Asp lie Phe His Val Phe Pro Gin Ser His Glu Glu Glu lie 
435 440 445 

Glu Phe lie Phe Ala Ser Glu Cys Lys Thr Gly Phe Arg His Leu Tyr 
450 455 460 

Lys lie Thr Ser lie Leu Lys Glu Ser Lys Tyr Lys Arg Ser Ser Gly 

465 470 475 480 

Gly Leu Pro Ala Pro Ser Asp Phe Lys Cys Pro lie Lys Glu Glu lie 

485 490 495 

Ala lie Thr Ser Gly Glu Trp Glu Val Leu Gly Arg His Gly Ser Asn 

500 505 510 

lie Gin Val Asp Glu Val Arg Arg Leu Val Tyr Phe Glu Gly Thr Lys 
515 520 525 

Asp Ser Pro Leu Glu His His Leu Tyr Val Val Ser Tyr Val Asn Pro 
530 535 540 

Gly Glu Val Thr Arg Leu Thr Asp Arg Gly Tyr Ser His Ser Cys Cys 

545 550 555 560 

lie Ser Gin His Cys Asp Phe Phe lie Ser Lys Tyr Ser Asn Gin Lys 

565 570 575 
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Asn 

Pro 

His 

Cys 

val 

Ser 

Leu 

Tyr 

Lys 

Leu 

Ser 

Ser 

Pro 

Glu 

Asp 

Asp 




580 





585 





590 



Pro 

Thx 

Cys 

Lys 

Thr 

Lys 

Glu 

Phe 

Trp 

Ala 

Thr 

He 

Leu 

Asp 

Ser 

Val 



595 





600 





605 




Leu 

Arg 

Cys 

Ser 

Trp 













610 


< 210 > 22 

<211> 4685 

<212> DNA 

<213> Homo sapiens 

<400> 22 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggo ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga acgatccaga tggaCciuat Lcagacagaa tctattacct tgccatgtct 450 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattatteat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc etaaagteae ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 1260 

caaccttttg agattetatt tgaaggagtt gaatatattg ccagagctgg atggactcct 1320 

gagggaaaat atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 1380 

ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 1440 

attgagteag tgcctgattc tgtgacgcca etaattatet atgaagaaac aacagacatc 1500 

tggataaata tccatgacat ctttcatgtt tttccccaaa gteaegaaga ggaaattgag 1560 

tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 1620 

ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 1680 

tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggeeggeat 1740 

ggatetaata tccaagttga tgaagteaga aggctggtat attttgaagg caccaaagac 1800 

tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 1860 

ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga ettetttata 1920 

agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 1980 

gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagtectc 2040 

aggtgcagtt ggtgaataat cggtttaaag gagtcaagta tttccgcttg aataccctag 2100 

cctctctagg ttatgtggtt gtagtgatag acaacagggg atcctgtcac egagggetta 2160 

aatttgaagg cgcctttaaa tataaaatgg gtcaaataga aattgaegat caggtggaag 2220 

gactccaata tetagettet egatatgatt teattgaett agatcgtgtg ggcatccacg 2280 

gctggtccta tggaggatac ctctccctga tggcattaat gcagaggtca gatatettea 2340 

gggttgctat tgctggggcc ccagtcactc tgtggatctt ctatgataca ggatacacgg 2400 

aacgttatat gggtcaccct gaccagaatg aacagggcta ttacttagga tctgtggcca 2460 

tgcaagcaga aaagttcccc tctgaaccaa atcgtttact gctcttacat ggtttcctgg 2520 

atgagaatgt ccattttgca cataccagta tattactgag ttttttagtg agggctggaa 2580 

agccatatga tttacagatc tatcctcagg agagacacag cataagagtt cctgaatcgg 2640 

gagaacatta tgaactgcat cttttgcact accttcaaga aaaccttgga teaegtattg 2700 

ctgctctaaa agtgatataa ttttgacctg tgtagaactc tctggtatac actggctatt 2760 

taaccaaatg aggaggttta atcaacagaa aacacagaat tgatcatcac attttgatac 2820 
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ctgccatgta acatctactc ctgaaaataa atgtggtgcc atgcaggggt ctacggtttg 2880 

tggtagtaat ctaatacctt aaccccacat gctcaaaatc aaatgataca tattcctgag 2940 

agacccagca ataccataag aattactaaa aaaaaaaaaa aaaaaaagac attagcacca 3000 

tgtattcata ctaccctatt ttcactttta atagtattat aaacttcatg aacttaatta 3060 

gtgtattttt acagtatact tttgagtttg ttaaaatatg atgatattag tgattggttt 3120 

ggttcagttc cagaatcttt gactagttac agatttgata gcacttaaat gtaattgaat 3180 

agcttatgct tcattgcttg ggcatatcca gcatgttatg aactaataac tattaaactt 3240 

gacttaacca gtcattcatt aataattttt caaggataac ttagtggcct cctaaagaca 3300 

cttgttttgg cactgaccag tttttagcca atttaatctg tatctagtat aaataattct 3360 

catttttctt tgatgatatt aacagagtgg gcttttcctt ttgcataaag gctagtaact 3420 

gtatatgtag catggattta attagtcatg atattgataa ttacaggcag aaaattttta 3480 

atcaaatgat tagagcttaa atatttgcag gcaagttttt ttttttcctt taagaaaagg 3540 

aaaaagtaca cattcactag aattcttcag aaaatttagt ggtgccagtt tccatttggt 3600 

atttccttat taaaatattc tagaatttta aggagattga agggaatcac agtggggtgg 3660 

ggagacctgg gtttggggaa tgacagagag aagaggtggt gagggcctga ttaaaaacta 3720 

agcagaagta gttttaacaa aaatactcat gaaaatgttt ggaaactgaa atttaaacaa 3780 

ctgtaatatt aaggaaacca gaatcaataa atcactgtct tgccagcaca gctacagagt 3840 

aacatgattc aggggaggaa aagttcctta gagttacttt tataattctt tttttttttc 3900 

ctcttaggtt tagaaatctt acaaatttaa actttatcct tttaaaatta tttgaacata 3960 

atttagatat tgtaagctta aaatacaaat gtttatagat aacctcttta ccataaacta 4020 

atccctggca agccatggct ctcttttttt ttttggtgtt taaagcctgt aaacagtttt 4080 

tctgaatgat catgaacttt tcttggttta gcactaggat ttagctatga agagagctca 4140 

taggctttca ggtgctaatt gagatctgcc ctgttagagt cttggggtgc tagattggtc 4200 

acattgacac cagtggcagg gaaggcatct atgagtttga tgctttttat cacacacttc 4260 

agtgtttaga aagttattac caatactttt aaacaacact ccaagaaaat ttgctatatt 4320 

tctttctcat cactacagag agagtagatt tccccataga gagcacagcc tccattagta 4380 

aggttggtga ctattggtaa gaggtggact tcattgacac caagtgggag gtagggaaag 4440 

cccagaaatg gcaggatgat atggtggttc tgtcgttggg aaaggtattg ggttttgctg 4500 

tttgtattta tactgtataa tagataccac gctttttctt attatctgta tatgtattgc 4560 

ttttcatgtt tgatattttc ccatgccaag atttgtttat atatattttc aatgttaaat 4620 

taaattgatt tgggtaactt tcttccccaa gaaagtattt tcccccttaa gtataaatct 4680 

gactg 4685 
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645 650 

Pro His Ala Leu Gin Pro Gly Lys Lys His Pro Thr Val 

660 665 

Tyr Gly Gly Pro Gin Val Gin Leu Val Asn Asn Ser Phe 
675 680 685 

Lys Tyr Leu Arg Leu Asn Thr Leu Ala Ser Leu Gly Tyr 
690 695 700 

Val lie Asp Gly Arg Gly Ser Cys Gin Arg Gly Leu Arg 
705 710 715 

Ala Leu Lys Asn Gin Met Gly Gin Val Glu lie Glu Asp 

725 730 

Gly Leu Gin Phe Val Ala Glu Lys Tyr Gly Phe lie Asp 

740 745 

Val Ala lie His Gly Trp Ser Tyr Gly Gly Phe Leu Ser 
755 760 765 

Leu He His Lys Pro Gin Val Phe Lys Val Ala lie Ala 
770 775 780 

Val Thr Val Trp Met Ala Tyr Asp Thr Gly Tyr Thr Glu 
785 790 795 

Asp Val Pro Glu Asn Asn Gin His Gly Tyr Glu Ala Gly 

80 b mo 

Leu His Val Glu Lys Leu Pro Asn Glu Pro Asn Arg Leu 

820 825 

His Gly Phe Leu Asp Glu Asn Val His Phe Phe His Thr 
835 840 845 

Val Ser Gin Leu lie Arg Ala Gly Lys Pro Tyr Gin Leu 
850 855 860 

Pro Asn Glu Arg His Ser lie Arg Cys Pro Glu Ser Gly 
865 870 875 

Glu Val Thr Leu Leu His Phe Leu Gin Glu Tyr Leu 

885 890 
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caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 
tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 
tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 
cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 
gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 
cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 
gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 
gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 
cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 
gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 
tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 
gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 
ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 
atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 
ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 
gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 
gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 
gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 
gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 
tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 
gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 
aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 
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ctggctgagt 

gtgcagccct 

cgggatggca 

gtcctcctcc 

tctgccagag 

gtctggatca 

ctctgctttc 

gccgttttaa 

tttaagtgcc 

aggcacggct 

aaggacacgc 

gtacgcctca 

ttcgtcagcc 

ggccccgacg 

gccagctgcc 

gtgcggctct 
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caggtggaga 
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tccagactga 

tcagctcgct 

aatacgcctg 

ccccggccct 

ctgtccccag 

atgttcatga 

tccgcgccaa 

aatcccaggg 

ccattaagga 

ccaagatctg 

cgctggagca 

ccacgcccgg 

actacagcag 

acgaccccct 

ccccggatta 

acggcatgat 

ttgtatatgg 

tgcggctcaa 

cctgtcagcg 

tcgaggacca 

gccgagttgc 

acaagcccca 

acgacacagg 

aggcgggttc 

tcctccacgg 

aactgatccg 

ttcgctgccc 

acctctgagc 

ggggaaccag 

ccagcgctgg 

tttaaacgct 

gtctcgggcc 

cactggagag 

gacctctcag 

tgtccccccc 

cataattatt 

ggaggctgag 

gggaaacccc 

taatcccagc 

tgcggtgagc 

ctcaaaataa 

tccacaccgg 

cgagcagagg 

ctgtggacag 

acgctctgct 

ctgcctgacc 

ggttatattt 

aaaaagtaaa 

aagaaaatat 

ttaaacgatt 


cagccagggc 

gttcccgaag 

ggccatgttc 

gttcatcccg 

gaatgtccag 

catcttctat 

tgaatgcaag 

ctacgattgg 

agagattgct 

ggtcaatgag 

ccacctctac 

cttctcccat 

cgtgagcacg 

gcacaagcag 

tgttcctcca 

ctacaagccc 

aggcccccag 

cacactggcc 

agggcttcgg 

ggtggagggc 

catccatggc 

ggtgttcaag 

gtacactgag 

cgtggccctg 

cttcctggac 

agcagggaaa 

cgagtcgggc 

ctgcccaccg 

gcgggaggga 

ccagccccga 

cttgggtttt 

agcccctcct 

gccagaagag 

tcccctggcc 

ggccagcctc 

taaaagacag 
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ataaaaaata 

gcagcggggt 

gtcatgcggg 

ccctcctgtc 

ggtcagtggc 

cgtacacacc 

ccctctcccc 

aagaaaagaa 

ttttgtcgat 

ccagttggaa 


aagatcgtct 

gtggagtaca 

ctggaccggc 

agcacagaga 

ccgtatgtgg 

cccttccccc 

accggcttct 

agtgagccct 

ctgaccagcg 

gagaccaagc 

gtggtcagct 
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ccccgcttct 

gagatcttcc 
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gtgcagctgg 

tccctgggct 

ttcgaagggg 

ctgcagttcg 

tggtectacg 

gtggccatcg 

cgctacatgg 

cacgtggaga 

gaaaacgtgc 

ccttaccagc 

gagcactatg 

ggagccgcca 

ctgagtggcc 

ggagccgctg 

atgtccgctg 

ctccccgcct 

actcagagga 

cggccagcca 

cccaacttga 

gccgggcgcg 

cacctgaggt 

aaaatacaaa 

gctgaggcag 

ccattgcact 

aaagacagaa 

tgcaacccag 

ccccacagga 

cccaagcttt 

ggctgtcccc 

aggggttccg 

ttccctcccc 

aaaaaaaaaa 

tcttattctt 

acatgtcacc 


cgacccagga 

tcgccagggc 

cccagcagtg 

atgaggagca 

tgtacgagga 

aatcagaggg 

gccatttgta 

tcagccccgg 

gtgaatggga 

tggtgtactt 

atgaggcggc 

tgagccagaa 

tgcacgtcta 

gggctageat 

atttccacac 

agccagggaa 

tgaataactc 

acgccgtggt 

ccctgaaaaa 

tggccgagaa 

-— bM.t. 

y y y y 

cgggtgcccc 

acgtccctga 

agctgcccaa 

actttttcca 

tccagatcta 

aagtcacgtt 

catcacagca 

cgcgggcccc 

ccttcaccgc 

cttcttggtt 

tctgggagga 

gcgggctgcc 

ccgtccccag 

tgtttgtgtt 

gtggctcacg 

tgggagttca 

aaattagccg 

gagaatcgct 

ccagcctggg 

agcaaggggt 

cacctggtag 

gaagcggcca 

caggcaggca 

tccccagccc 

gggttgggag 

gccaagagct 

aagaaacaaa 

ttataattat 

tg 


gaaggagctg 

cgggtggacc 

gctccagctc 

gcggctagcc 

ggtcaccaac 

agaggacgag 

caaagtcacc 

ggaagatgaa 

ggttttggcg 

ccagggcacc 

cggcgagatc 

cttcgacatg 

caagctgagc 

gatggaggca 

gcgctcggat 

gaagcacccc 

cttcaaaggc 

tgtgattgac 

ccaaatgggc 

gtatggcttc 

M ^ M ^ A 

ggtcaccgtc 

gaacaaccag 

tgagcccaac 

cacaaacttc 

ccccaacgag 

gctgcacttt 

caagtggctg 

agtgaggcac 

cccgacgcct 

gccgagacag 

ggaggtcaca 

ttccgcctgg 

cacccaagca 

ttgtttgggg 

tctgtaatcc 

agaccagcct 

ggtgtggtgg 

tgaacccggg 

caacaagagc 

gcctaaatct 

gctccatttc 

gggcccgcgg 

ctgaaacgca 

agccgcccag 

ctgaaccatc 

ctgccagggg 

ccacctctac 

gcgtggaaga 


<210> 25 

<211> 518 

<212> PRT 

<213> Homo sapiens 

<400> 25 

Met Arg Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser 
15 10 15 

Trp Arg Ser Phe Ser Leu Asn Ser Glu Gly Ala Glu Arg Met Ala Thr 


1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

A C O A 
W 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4302 
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20 25 30 

Thr Gly Thr Pro Thr Ala Asp Arg Gly Asp Ala Ala Ala Thr Asp Asp 
35 40 45 

Pro Ala Ala Arg Phe Gin Val Gin Lys His Ser Trp Asp Gly Leu Arg 
50 55 60 

Ser lie lie His Gly Ser Arg Lys Tyr Ser Gly Leu lie Val Asn Lys 

65 70 75 80 

Ala Pro His Asp Phe Gin Phe Val Gin Lys Thr Asp Glu Ser Gly Pro 

85 90 95 

His Ser His Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly Ser Arg Glu 

100 105 110 

Asn Ser Leu Leu Tyr Ser Glu lie Pro Lys Lys val Arg Lys Glu Ala 
115 120 125 

Leu Leu Leu Leu Ser Trp Lys Gin Met Leu Asp His Phe Gin Ala Thr 
130 135 140 

Pro His His Gly Val Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg 

145 150 155 160 

Lys Arg Leu Gly Val Phe Gly lie Thr Ser Tyr Asp Phe His Ser Glu 

165 170 175 

Ser Gly Leu Phe Leu Phe Gin Ala Ser Asn Ser Leu Phe His Cys Arg 

180 185 190 

Asp Gly Gly Lys Asn Gly Phe Met Val Ser Pro Met Lys Pro Leu Glu 
195 200 205 

lie Lys Thr Gin Cys Ser Gly Pro Arg Met Asp Pro Lys lie Cys Pro 
210 215 220 

Ala Asp Pro Ala Phe Phe Ser Phe lie Asn Asn Ser Asp Leu Trp Val 

225 230 235 240 

Ala Asn lie Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe Cys His Gin 

245 250 255 

Gly Leu Ser Asn Val Leu Asp Asp Pro Lys Ser Ala Gly Val Ala Thr 

260 265 270 

Phe Val lie Gin Glu Glu Phe Asp Arg Phe Thr Gly Tyr Trp Trp Cys 
275 280 285 

Pro Thr Ala Ser Trp Glu Gly Ser Glu Gly Leu Lys Thr Leu Arg He 
290 295 300 

Leu Tyr Glu Glu Val Asp Glu Ser Glu Val Glu Val lie His Val Pro 

305 310 315 320 

Ser Pro Ala Leu Glu Glu Arg Lys Thr Asp Ser Tyr Arg Tyr Pro Arg 

325 330 335 

Thr Gly Ser Lys Asn Pro Lys lie Ala Leu Lys Leu Ala Glu Phe Gin 

340 345 350 

Thr Asp Ser Gin Gly Lys lie Val Ser Thr Gin Glu Lys Glu Leu Val 
355 360 365 

Gin Pro Phe Ser Ser Leu Phe Pro Lys Val Glu Tyr lie Ala Arg Ala 
370 375 380 

Gly Trp Thr Arg Asp Gly Lys Tyr Ala Trp Ala Met Phe Leu Asp Arg 

385 390 395 400 

Pro Gin Gin Trp Leu Gin Leu Val Leu Leu Pro Pro Ala Leu Phe lie 

405 410 415 

Pro Ser Thr Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg Ala Val 

420 425 430 

Pro Arg Asn Val Gin Pro Tyr Val Val Tyr Glu Glu Val Thr Asn Val 
435 440 445 

Trp lie Asn Val His Asp lie Phe Tyr Pro Phe Pro Gin Ser Glu Gly 
450 455 460 

Glu Asp Glu Leu Cys Phe Leu Arg Ala Asn Glu Cys Lys Thr Gly Phe 

465 470 475 480 

Cys His Leu Tyr Lys Val Thr Ala Val Leu Lys Ser Gin Gly Tyr Asp 

485 490 495 

Trp Ser Glu Pro Phe Ser Pro Gly Glu Gly Glu Gin Ser Leu Thr Asn 


32 


WO 02/31134 


PCT/US01/31874 


500 

Ala Val Asp Ser Ser Arg 
515 


505 


510 


<210> 26 

<211> 2411 

<212> DNA 

<213> Homo sapiens 

<400> 26 


caggccgccg 

tgtgtccgcc 

tggagccgcg 

cccacgtccc 

gaaggcaccc 

cgcctggaca 

gagaggatgg 

gacccggccg 

cacggcagcc 

gtgcaqaaqa 

tatggcagcc 

gctctgctgc 

ggggtctact 

atcacctcct 

ctcttccact 

gaaatcaaga 

gccttcttct 

gagcggcggc 

gcgggtgtgg 

tgccccacag 

gaagtcgatg 

aagacggact 

ctggctgagt 

gtgcagccct 

cgggatggoa 

gtcctcctcc 

tctgccagag 

gtctggatca 

ctctgctttc 

gccgttttaa 

cagagcctga 

tgttcattgg 

ggcaggaagc 

gttggcttat 

accaggggtt 

ataaagtttt 

tttgtgctgc 

ttactgtcta 

attgttcact 

aggcagtatt 

agatcctagt 


cctgggtcgc 

gcggctgtcg 

accgtgaggc 

ggtctgtgtc 

ctgccctcct 

aggagaacac 

ccaccaccgg 

cccgcttcca 

gcaagtactc 

cggatgagtc 

gagagaactc 

tcctgtcctg 

ctcgggagga 

acgacttcca 

gccgcgacgg 

cccagtgctc 

ccttcatcaa 

tgaccttctg 

ccaccttcgt 

cctcctggga 

agtccgaggt 

cgtatcggta 

tccagactga 

tcagctcgct 

aatacgcctg 

ccccggccct 

ctgtccccag 

atgttcatga 

tccgcgccaa 

aatcccaggg 

cgaatgctgt 

tcggcccccc 

cagccttccc 

tcaggcagca 

ggccacctac 

attggcactg 

agcagcagaa 

gccctttgta 

gtaaataaag 

ccagaaacag 

c 


tcaacttccg 

tcccccgctc 

gccgctggac 

ccacgcctgc 

gaggtcagct 

cggaagttgg 

gaccccaacg 

ggtgcagaag 

gggcctcatt 

tgggccccac 

cctcctctac 

gaagcagatg 

ggagctgctg 

cagcgagagt 

cggcaagaac 

agggccocgg 

taacagcgac 

ccaccaaggt 

catacaggaa 

aggttcagag 

ggaggtcatt 

ccccaggaca 

cagccagggc 

gttcccgaag 

ggccatgttc 

gttcatcccg 

gaatgtccag 

catcttctat 

tgaatgcaag 

ctacgattgg 

cgactcatcg 

cactcagcca 

cagtgactgc 

gatttactga 

ggcccacagg 

agccacagcc 

ctgggtagtc 

gaaacatttg 

aaaaaggaaa 

ttagaggtgc 


ggtcaaaggt 

ccgccacttc 

ccgggaogao 

agctggaatg 

gagcggttaa 

agaagcttct 

gccgaccgag 

cactcgtggg 

gtcaacaagg 

tcccaccgcc 

tctgagattc 

ctggatcatt 

agggagcgga 

ggcctcttcc 

ggcttcatgg 

atggacccca 

ctgtgggtgg 

ttatccaatg 

gagttcgacc 

ggcctcaaga 

cacgtcccct 

ggcagcaaga 

aagatcgtct 

gtggagtaoa 

ctggaccggc 

agcacagaga 

ccgtatgtgg 

cccttccccc 

accggcttct 

agtgagccct 

cgttagtcac 

gcacaccctg 

atgatctggc 

gcacctcccc 

tcaaatccgg 

acttgtttac 

ccagcagaaa 

ccagctcctg 

atccctgctc 

tgcctctggt 


gcctgagccg 

cggggtcgca 

ctgcccagtc 

gaggctctct 

tgcggaaggt 

cgctgaattc 

gcgacgcagc 

acgggctccg 

cgccccacga 

tctactacct 

ccaagaaggt 

tccaggccac 

aacgcctggg 

tcttccaggc 

tgtcccctat 

aaatctgccc 

ccaacatcga 

tcctggatga 

gcttcactgg 

cgctgcgaat 

ctcctgcgct 

atcccaagat 

cgacccagga 

tcgccagggc 

cccagcagtg 

atgaggagca 

tgtacgagga 

aatcagaggg 

gccatttgta 

tcagccccgg 

gtgtggttca 

cgggagaagg 

agggcttaga 

tgtgccaggc 

cccaccacct 

agagactgtc 

ctgttgtgca 

ctgtaggtag 

ttgggacctt 

gtgctgtggg 


gcgggtcccc 60 
gtcccgggca 120 
cggccgccgc 180 
ggacccttta 240 
taagaaactg 300 
cgagggggct 360 
cgccacagat 420 
gagcatcatc 480 
cttccagttt 540 
gggaatgcca 600 
ccggaaagag 660 
gccccaccat 720 
ggtcttcggc 780 
cagcaacagc 840 
gaaaccgctg 900 
tgccgaccct 960 
gacaggcgag 1020 
ccccaagtct 1080 
gtactggtgg 1140 
cctgtatgag 1200 
agaagaaagg 1260 
tgccttgaaa 1320 
gaaggagctg 1380 
cgggtggacc 1440 
gctccagctc 1500 
gcggctagcc 1560 
ggtcaccaac 1620 
agaggacgag 1680 
caaagtcacc 1740 
ggaaggtgag 1800 
atatgctgtt 1860 
aacagggatc 1920 
gcacccaact 1980 
ccttagcaca 2040 
gtgttcataa 2100 
tgtggtcgct 2160 
aggccaagat 2220 
ctgtgatgga 2280 
ctagtggagg 2340 
tggcagatgc 2400 

2411 


<210> 

27 

<211> 

892 

<212> 

PRT 

<213 > 

Homo 

<4 00 > 

27 


Met Arg Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser 
]_ 5 10 IS 
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Trp Arg Ser Phe Ser Leu Asn Ser Glu Gly Ala Glu Arg Met Ala Thr 

20 25 30 

Thr Gly Thr Pro Thr Ala Asp Arg Gly Asp Ala Ala Ala Thr Asp Asp 
35 40 45 

Pro Ala Ala Arg Phe Gin Val Gin Lys His Ser Trp Asp Gly Leu Arg 
50 55 60 

Ser He lie His Gly Ser Arg Lys Tyr Ser Gly Leu lie Val Asn Lys 

65 70 75 80 

Ala Pro His Asp Phe Gin Phe Val Gin Lys Thr Asp Glu Ser Gly Pro 

85 90 95 

His Ser His Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly Ser Arg Glu 

100 105 110 

Asn Ser Leu Leu Tyr Ser Glu lie Pro Lys Lys Val Arg Lys Glu Ala 

lis 120 125 

Leu Leu Leu Leu Ser Trp Lys Gin Met Leu Asp His Phe Gin Ala Thr 
130 135 140 

Pro His His Gly val Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg 

145 150 155 160 

Lys Arg Leu Gly Val Phe Gly lie Thr Ser Tyr Asp Phe His Ser Glu 

165 170 175 

fiftr G1 v TjPH Phe T.eu Phe filn ala Car &an Cai- t.ah __*_ 

- - - - — - -----— — -W*- — — — -O'- 

180 185 190 

Asp Gly Gly Lys Asn Gly Phe Met Val Ser Pro Met Lys Pro Leu Glu 
195 200 205 

lie Lys Thr Gin Cys Ser Gly Pro Arg Met Asp Pro Lys lie Cys Pro 
210 215 220 

Ala Asp Pro Ala phe Phe Ser Phe lie Asn Asn Ser Asp Leu Trp Val 

225 230 235 240 

Ala Asn lie Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe Cys His Gin 

245 250 255 

Gly Leu Ser Asn Val Leu Asp Asp Pro Lys Ser Ala Gly val Ala Thr 

260 265 270 

Phe Val lie Gin Glu Glu Phe Asp Arg Phe Thr Gly Tyr Trp Trp Cys 
275 280 285 

Pro Thr Ala Ser Trp Glu Gly Ser Glu Gly Leu Lys Thr Leu Arg lie 
290 295 300 

Leu Tyr Glu Glu Val Asp Glu Ser Glu Val Glu Val lie His Val Pro 

305 310 315 320 

Ser Pro Ala Leu Glu Glu Arg Lys Thr Asp Ser Tyr Arg Tyr Pro Arg 

325 330 335 

Thr Gly Ser Lys Asn Pro Lys lie Ala Leu Lys Leu Ala Glu Phe Gin 

340 345 350 

Thr Asp Ser Gin Gly Lys lie Val Ser Thr Gin Glu Lys Glu Leu Val 
355 360 365 

Gin Pro Phe Ser ser Leu Phe Pro Lys Val Glu Tyr lie Ala Arg Ala 
370 375 380 

Gly Trp Thr Arg Asp Gly Lys Tyr Ala Trp Ala Met Phe Leu Asp Arg 

385 390 395 400 

Pro Gin Gin Trp Leu Gin Leu Val Leu Leu Pro Pro Ala Leu Phe lie 

405 410 415 

Pro Ser Thr Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg Ala Val 

420 425 430 

Pro Arg Asn Val Gin Pro Tyr Val Val Tyr Glu Glu Val Thr Asn Val 
435 440 445 

Trp lie Asn Val His Asp lie Phe Tyr Pro Phe Pro Gin Ser Glu Gly 
450 455 460 

Glu Asp Glu Leu Cys Phe Leu Arg Ala Asn Glu Cys Lys Thr Gly Phe 

465 470 475 480 

Cys His Leu Tyr Lys Val Thr Ala Val Leu Lys Ser Gin Gly Tyr Asp 

485 490 495 
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Ser 

Glu 

Pro 

500 

Phe 

Ser 

Pro 
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Glu 

515 

lie 

Ala 

Leu 

Thr 

Ser 

520 

His 

Gly 

530 

Ser 

Lys 

lie 
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Val 

535 

Asn 

Gin 

545 

Gly 

Thr 

Lys 
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Thr 

550 

Pro 

Leu 

Tyr 

Glu 

Ala 

Ala 

Gly 

565 

Glu 
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Val 

His 

Ser 

Cys 

Ser 

580 

Met 

Ser 

Gin 
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Ser 

Ser 

Val 

595 

Ser 

Thr 

Pro 

Pro 

Cys 

600 

Pro 

Asp 

610 

Asp 

Asp 

Pro 

Leu 

His 

615 

Lys 

Met 

625 

Glu 

Ala 

Ala 

Ser 

Cys 

630 

Pro 

Pro 

His 

Phe 

His 

Thr 

Arg 

645 

Ser 

Asp 

Val 

Pro 

His 

Ala 

Leu 

660 

Gin 

Pro 

Gly 

Lys 

Tyr 

Gly 

Gly 

675 

Pro 

Gin 

Val 

Gin 

Leu 

680 

Lys 

Tyr 

690 

Leu 

Arg 

Leu 

Asn 

Thr 

695 

Leu 

val 

705 

He 

Asp 

Gly 

Arg 

Gly 

710 

Ser 

Cys 

Ala 

Leu 

Lys 

Asn 

Gin 

725 

Met 

Gly 

Gin 

Gly 

Leu 

Gin 

Phe 

740 

Val 

Ala 

Glu 

Lys 

val 

Ala 

lie 

755 

His 

Gly 

Trp 

Ser 

Tyr 

760 

Leu 

lie 

770 

His 

Lys 

Pro 

Gin 

Val 

775 

Phe 

val 

785 

Thr 

Val 

Trp 

Met 

Ala 

790 

Tyr 
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Val 

Pro 

Glu 

Asn 

805 

Asn 

Gin 

His 
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His 

Val 

Glu 

820 

Lys 

Leu 

Pro 

Asn 

His 
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Asp 

GlU 

Asn 

Val 

840 

Val 
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850 

Gin 
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Arg 

Ala 

855 

Gly 
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GlU 

505 

Asp 

Glu 

Phe 

Lys 

Cys 

510 

Pro 

lie 

Gly 

Glu 

Trp 

Glu 

Val 

525 

Leu 

Ala 

Arg 

Glu 

Glu 

Thr 

Lys 

540 

lieu 

Val 

Tyr 

Phe 

Glu 

His 

His 

555 

Leu 

Tyr 

Val 

Val 

Ser 

560 

Arg 

Leu 

570 

Thr 

Thr 

Pro 

Gly 

Phe 

575 

Ser 

Phe 

585 

Asp 

Met 

Phe 

Val 

Ser 

590 

His 

Tyr 

Val 

His 

Val 

Tyr 

Lys 

605 

Leu 

Ser 

Gly 

Gin 

Pro 

Arg 

Phe 

620 

Trp 

Ala 

Ser 

Met 

Asp 

Tyr 

Val 

635 

Pro 

Pro 

Glu 

lie 

Phe 

640 

Arg 

Leu 

650 

Tyr 

Gly 

Met 

lie 

Tyr 

655 

Lys 


His 


Thx 


^ _ * _ 

T\l. — 


jjys 

665 

ftO 

vai 

ueu 

670 

true 

vai 

Val 

Asn 

Asn 

Ser 

Phe 

685 

Lys 

Gly 

lie 

Ala 

Ser 

Leu 

Gly 

700 

Tyr 

Ala 

Val 

Val 

Gin 

Arg 

Gly 

715 

Leu 

Arg 

Phe 

Glu 

Gly 

720 

Val 

Glu 

730 

lie 

Glu 

Asp 

Gin 

Val 

735 

Glu 

Tyr Gly 
745 

Phe 

lie 

Asp 

Leu 

750 

Ser 

Arg 

Gly Gly 

Phe 

Leu 

Ser 

765 

Leu 

Met 

Gly 

Lys 

Val 

Ala 

lie 

780 

Ala 

Gly 

Ala 

Pro 

Thr 

Gly 

Tyr 

795 

Thr 

Glu 

Arg 

Tyr 

Met 

800 

Gly Tyr 
810 

Glu 

Ala 

Gly 

Ser 

val 

815 

Ala 

GlU 

825 

Pro 

Asn 

Arg 

Leu 

Leu 

830 

lie 

Leu 

His 

Phe 

Phe 

His 

Thr 

845 

Asn 

Phe 

Leu 

Lys 

Pro 

Tyr 

Gin 

860 

Leu 

Gin 

lie 

Tyr 
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Pro Asn Glu Arg His Ser He Arg Cys Pro Glu Ser Gly Glu His Tyr 
865 870 875 880 

Glu Val Thr Leu Leu His Phe Leu Gin Glu Tyr Leu 

885 890 

<210 > 28 

<211> 4219 

<212 > DNA 

<213 > Homo sapiens 

<400 > 28 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tgg a 9 cc 9 c 9 accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

c 999 at 99 ca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 1920 

aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 1980 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 2160 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggetct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 

ggeaggggct cctgtcagcg agggcttcgg ttcgaagggg ccctgaaaaa ccaaatgggc 2460 

caggtggaga tcgaggacca ggtggagggc ctgcagttcg tggccgagaa gtatggcttc 2520 

atcgacctga gccgagttgc catccatggc tggtcctacg ggggcttcct ctcgctcatg 2580 

gggctaatcc acaagcccca ggtgttcaag gtggccatcg cgggtgcccc ggtcaccgtc 2640 

tggatggcct acgacacagg gtacactgag cgctacatgg acgtccctga gaacaaccag 2700 

cacggctatg aggcgggttc cgtggccctg cacgtggaga agctgcccaa tgagcccaac 2760 

cgcttgctta tcctccacgg cttcctggac gaaaacgtgc actttttcca cacaaacttc 2820 

ctcgtctccc aactgatccg agcagggaaa ccttaccagc tccagatcta ccccaacgag 2880 

agacacagta ttcgctgccc cgagtcgggc gagcactatg aagtcacgtt gctgcacttt 2940 
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ctacaggaat acctctgagc ctgcccaccg 
cagcctccgc ggggaaccag gcgggaggga 
tttgtcccgc ccagcgctgg ccagccccga 
tttatccttt tttaaacgct cttgggtttt 
agagatggtg gtctcgggcc agcccctcct 
cgctgatggg cactggagag gccagaagag 
ggctccctgt gacctctcag tcccctggcc 
tgcaattgcc tgtccccccc ggccagcctc 
ggatattttt cataattatt taaaagacag 
cagcactttg ggaggctgag gcgggcggat 
ggccaacatg gggaaacccc gtctctacta 
cgcgtgccta taatcccagc tactcgggag 

tgcggtgagc caagatcgca 
gaaactctgt ctcaaaataa ataaaaaata 
agacttgggg tccacaccgg gcagcggggt 
ttcccaagcc cgactttcag gcaggcactg 
cagtggcggc tgtcccctcc ccagcccagc 
acacaccagg ggttccgggg ttgggagctg 
tctccccttc cctccccgcc aagagctctg 
aaaagaaaaa aaaaaaaaag aaacaaacca 
tgtcgattct tattcttti-a taattatgcy 
gttggaaaca tgtcacctg 


ggagccgcca catcacagca caagtggctg 
ctgagtggcc cgcgggcccc agtgaggcac 
ggagccgctg ccttcaccgc cccgacgcct 
atgtccgctg cttcttggtt gccgagacag 
ctccccgcct tctgggagga ggaggtcaca 
actcagagga gcgggctgcc ttccgcctgg 
cggccagcca ccgtccccag cacccaagca 
cccaacttga tgtttgtgtt ttgtttgggg 
gccgggcgcg gtggctcacg tctgtaatcc 
cacctgaggt tgggagttca agaccagcct 
aaaatacaaa aaattagccg ggtgtggtgg 
gctgaggcag gagaatcgct tgaacccggg 
ccattgcact ccagcctggg caacaagagc 
aaagacagaa agcaaggggt gcctaaatct 
tgcaacccag cacctggtag gctccatttc 
aaacgcaccg aacttccacg ctctgctggt 
cgcccagcca catgtgtctg cctgacccgt 
aaccatcccc acctcagggt tatatttccc 
ccaggggcgg gcaaaaaaaa aagtaaaaag 
cctctacata ttatggaaag aaaatatttt 
Cggaagaagt agacacau.ua aacgauucca 


<210> 29 

<211> 832 

<212> PRT 

<213> Homo sapiens 

<4 00> 29 


Met 

Arg 

Lys 

Val 

Lys 

to 

*4 

Leu 

Arg 

Leu 

Asp 

Lys 

Glu 

Asn 

Thr 

Gly 

Ser 

1 



5 





10 





15 


Trp 

Arg 

Ser 

phe 

Ser 

Leu 

Asn 

Ser 

Glu 

Gly 

Ala 

Glu 

Arg 

Met 

Ala 

Thr 


20 





25 





30 



Thr 

Gly 

Thr 

Pro 

Thr 

Ala 

Asp 

Arg 

Gly 

Asp 

Ala 

Ala 

Ala 

Thr 

Asp 

Asp 


35 





40 





45 




Pro 

Ala 

Ala 

Arg 

Phe 

Gin 

Val 

Gin 

Lys 

His 

Ser 

Trp 

Asp 

Gly 

Leu 

Arg 


50 





55 





60 





Ser 

lie 

lie 

His 

Gly 

Ser 

Arg 

Lys 

Tyr 

Ser 

Gly 

Leu 

lie 

Val 

Asn 

Lys 

65 





70 





75 





80 

Ala 

Pro 

His 

Asp 

Phe 

Gin 

Phe 

Val 

Gin 

Lys 

Thr 

Asp 

Glu 

Ser 

Gly 

Pro 





85 





90 





95 


His 

Ser 

His 

Arg 

Leu 

Tyr 

Tyr 

Leu 

Gly 

Met 

Pro 

Tyr 

Gly 

Ser 

Arg 

Glu 




100 





105 





110 



Asn 

Ser 

Leu 

Leu 

Tyr 

Ser 

Glu 

lie 

Pro 

Lys 

Lys 

Val 

Arg 

Lys 

Glu 

Ala 



115 





120 





125 




Leu 

Leu 

Leu 

Leu 

Ser 

Trp 

Lys 

Gin 

Met 

Leu 

Asp 

His 

Phe 

Gin 

Ala 

Thr 


130 





135 





140 





Pro 

His 

His 

Gly 

val 

Tyr 

Ser 

Arg 

Glu 

Glu 

Glu 

Leu 

Leu 

Arg 

Glu 

Arg 

145 





150 





155 





160 

Lys 

Arg 

Leu 

Gly 

Val 

Phe 

Gly 

lie 

Thr 

Ser 

Tyr 

Asp 

Phe 

His 

Ser 

Glu 



165 





170 





175 


Ser 

Gly 

Leu 

Phe 

Leu 

Phe 

Gin 

Ala 

Ser 

Asn 

Ser 

Leu 

Phe 

His 

Cys 

Arg 



180 





185 





190 



Asp 

Gly 

Gly 

Lys 

Asn 

Gly 

Phe 

Met 

Val 

Ser 

Pro 

Met 

Lys 

Pro 

Leu 

Glu 

195 





200 





205 




He 

Lys 

Thr 

Gin 

Cys 

Ser 

Gly 

Pro 

Arg 

Met 

Asp 

Pro 

Lys 

lie 

Cys 

Pro 


210 





215 





220 





Ala 

Asp 

Pro 

Ala 

Phe 

Phe 

Ser 

Phe 

lie 

Asn 

Asn 

Ser 

Asp 

Leu 

Trp 

Val 

225 




230 





235 





240 

Ala 

Asn 

lie 

Glu 

Thr 

Gly 

Glu 

Glu 

Arg 

Arg 

Leu 

Thr 

Phe 

Cys 

His 

Gin 


3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4 A A A 

4219 
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245 

Gly Leu Ser Asn Val Leu Asp Asp 

260 

Phe Val lie Gin Glu Glu Phe Asp 
275 280 

Pro Thr Ala Ser Trp Glu Gly Ser 
290 295 

Leu Tyr Glu Glu Val Asp Glu Ser 
305 310 

Ser Pro Ala Leu Glu Glu Arg Lys 

325 

Thr Gly Ser Lys Asn Pro Lys lie 

340 

Thr Asp Ser Gin Gly Lys lie Val 
355 360 

Gin Pro Phe Ser Ser Leu Phe Pro 
370 375 

Gly Trp Thr Arg Asp Gly Lys Tyr 
385 390 

Pro Gin Gin Trp Leu Gin Leu val 

405 

Pro Ser Thr Glu Asn Glu Glu Gin 

420 

Pro Arg Asn Val Gin Pro Tyr Val 
435 440 

Trp lie Asn Val His Asp He Phe 
450 455 

Glu Asp Glu Leu Cys Phe Leu Arg 
465 470 

Cys His Leu Tyr Lys Val Thr Ala 

485 

Trp Ser Glu Pro Phe Ser Pro Gly 

500 

Lys Glu Glu lie Ala Leu Thr Ser 
515 520 

His Gly Ser Lys He Trp Val Asn 
530 535 

Gin Gly Thr Lys Asp Thr Pro Leu 
545 550 

Tyr Glu Ala Ala Gly Glu lie Val 

565 

His Ser Cys Ser Met Ser Gin Asn 

580 

Ser Ser Val Ser Thr Pro Pro Cys 
595 600 

Pro Asp Asp Asp Pro Leu His Lys 
610 615 

Met Glu Ala Ala Ser Cys Pro Pro 
625 630 

His Phe His Thr Arg Ser Asp Val 

645 

Pro His Ala Leu Gin Pro Gly Lys 

660 

Tyr Gly Gly Pro Gin Val Gin Leu 
675 680 

Lys Tyr Leu Arg Leu Asn Thr Leu 
690 695 

Val lie Asp Gly Arg Gly Ser Cys 
705 710 

Ala Leu Lys Asn Gin Met Gly Gin 


250 255 

Pro Lys Ser Ala Gly Val Ala Thr 
265 270 

Arg Phe Thr Gly Tyr Trp Trp Cys 

285 

Glu Gly Leu Lys Thr Leu Arg lie 

300 

Glu Val Glu Val lie His Val Pro 
315 320 

Thr Asp Ser Tyr Arg Tyr Pro Arg 
330 335 

Ala Leu Lys Leu Ala Glu Phe Gin 
345 350 

Ser Thr Gin Glu Lys Glu Leu Val 

365 

Lys Val Glu Tyr lie Ala Arg Ala 

380 

Ala Trp Ala Met Phe Leu Asp Arg 
395 400 

Leu Leu Pro Pro Ala Leu Phe lie 
410 415 

Arg Leu Ala Ser Ala Arg Ala Val 
425 430 

Val Tyr Glu Glu Val Thr Asn Val 

445 

Tyr Pro Phe Pro Gin Ser Glu Gly 

460 

Ala Asn Glu Cys Lys Thr Gly Phe 
475 480 

Val Leu Lys Ser Gin Gly Tyr Asp 
490 495 

Glu Asp Glu Phe Lys Cys Pro lie 
505 510 

Gly Glu Trp Glu Val Leu Ala Arg 

525 

Glu Glu Thr Lys Leu Val Tyr Phe 

540 

Glu His His Leu Tyr Val Val Ser 
555 560 

Arg Leu Thr Thr Pro Gly Phe Ser 
570 575 

Phe Asp Met Phe Val Ser His Tyr 
585 590 

Val His Val Tyr Lys Leu Ser Gly 

605 

Gin Pro Arg Phe Trp Ala Ser Met 

620 

Asp Tyr Val Pro Pro Glu lie Phe 
635 640 

Arg Leu Tyr Gly Met lie Tyr Lys 
650 655 

Lys His Pro Thr Val Leu Phe Val 
665 670 

Val Asn Asn Ser Phe Lys Gly lie 

685 

Ala Ser Leu Gly Tyr Ala Val Val 

700 

Gin Arg Gly Leu Arg Phe Glu Gly 
715 720 

Val Glu lie Glu Asp Gin Val Glu 
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725 





730 





735 


Gly 

Leu 

Gin 

Phe 

Val 

Ala 

Glu 

Lys 

Tyr 

Gly 

Phe 

lie 

Asp 

Leu 

Ser 

Arg 




740 





745 





750 



val 

Ala 

He 

His 

Gly 

Trp 

Ser 

Tyr 

Gly 

Gly 

Phe 

Leu 

Ser 

Leu 

Met 

Gly 



755 





760 





765 




Leu 

lie 

His 

Lys 

Pro 

Gin 

val 

Phe 

Lys 

Ala 

Gin 

Pro 

Leu 

Ala 

Tyr 

Pro 


770 





775 





780 





Pro 

Arg 

Leu 

Pro 

Gly 

Arg 

Lys 

Arg 

Ala 

Leu 

Phe 

Pro 

His 

Lys 

Leu 

Pro 

785 





790 





795 





800 

Arg 

Leu 

Pro 

Thr 

Asp 

Pro 

Ser 

Arg 

Glu 

Thr 

Leu 

Pro 

Ala 

Pro 

Asp 

Leu 





805 





810 





815 


Pro 

Gin 

Arg 

Glu 

Thr 

Gin 

Tyr 

Ser 

Leu 

Pro 

Arg 

Val 

Gly 

Arg 

Ala 

Leu 




820 





825 





830 




<210> 30 

<211> 4159 

<212> DNA 

<213> Homo sapiens 

<400> 30 

r'annnnanra r'rtaaat.r.ar t’.caaef.fcera aatoaaaaat ocntaaapca ar.aaatr.r.nf’. fifl 
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tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 1920 

aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 1980 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 2160 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggctct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 
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ggcaggggct cctgtcagcg agggcttcgg ttcgaagggg ccctgaaaaa ccaaatgggc 
caggtggaga tcgaggacca ggtggagggc ctgcagttcg tggccgagaa gtatggcttc 
atcgacctga gccgagttgc catccatggc tggtcctacg ggggcttcct ctcgctcatg 
gggctaatcc acaagcccca ggtgttcaag gcccaaccgc ttgcttatcc tccacggctt 
cctggacgaa aacgtgcact ttttccacac aaacttcctc gtctcccaac tgatccgagc 
agggaaacct taccagctcc agatctaccc caacgagaga cacagtattc gctgccccga 
gtcgggcgag cactatgaag tcacgttgct gcactttcta caggaatacc tctgagcctg 
cccaccggga gccgccacat cacagcacaa gtggctgcag cctccgcggg gaaccaggcg 
ggagggactg agtggcccgc gggccccagt gaggcacttt gtcccgccca gcgctggcca 
gccccgagga gccgctgcct tcaccgcccc gacgcctttt atcctttttt aaacgctctt 
gggttttatg tccgctgctt cttggttgcc gagacagaga gatggtggtc tcgggccagc 
ccctcctctc cccgccttct gggaggagga ggtcacacgc tgatgggcac tggagaggcc 
agaagagact cagaggagcg ggctgccttc cgcctggggc tccctgtgac ctctcagtcc 
cctggcccgg ccagccaccg tccccagcac ccaagcatgc aattgcctgt cccccccggc 
cagcctcccc aacttgatgt ttgtgttttg tttgggggga tatttttcat aattatttaa 
aagacaggcc gggcgcggtg gctcacgtct gtaatcccag cactttggga ggctgaggcg 
ggcggatcac ctgaggttgg gagttcaaga ccagcctggc caacatgggg aaaccccgtc 
tctactaaaa atacaaaaaa ttagccgggt gtggtggcgc gtgcctataa tcccagctac 
tcgggaggct gaggcaggag aatcgcttga acccgggagg tggaggttgc ggtgagccaa 
gatcgcacca ttgcactcca gcctgggcaa caagagcgaa actctgtctc aaaataaata 
aaaaacaaaa gacagaaagc aaggggtgcc taaacctaga cttggggtcc acaccgggca 
gcggggttgc aacccagcac ctggtaggct ccatttcttc ccaagcccga gcagagggtc 
atgcgggccc cacaggagaa gcggccaggg cccgcggggg gcaccacctg tggacagccc 
tcctgtcccc aagctttcag gcaggcactg aaacgcaccg aacttccacg ctctgctggt 
cagtggcggc tgtcccctcc ccagcccagc cgcccagcca catgtgtctg cctgacccgt 
acacaccagg ggttccgggg ttgggagctg aaccatcccc acctcagggt tatatttccc 
tctccccttc cctccccgcc aagagctctg ccaggggcgg gcaaaaaaaa aagtaaaaag 
aaaagaaaaa aaaaaaaaag aaacaaacca cctctacata ttatggaaag aaaatatttt 
tgtcgattct tattctttta taattatgcg tggaagaagt agacacatta aacgattcca 
gttggaaaca tgtcacctg 

<210> 31 

<211> 832 

<212> PRT 

<213 > Homo sapiens 

<400> 31 


Met 

1 

Arg 

Lys 

Val 

Lys 

5 

Lys 

Leu 

Arg 

Leu 

Asp 

10 

Lys Glu 

Asn 

Thr 

Gly 

15 

Ser 

Trp 

Arg 

Ser 

Phe 

20 

Ser 

Leu 

Asn 

Ser 

Glu 

25 

Gly 

Ala Glu 

Arg 

Met 

30 

Ala 

Thr 

Thr 

Gly Thr 
35 

Pro 

Thr 

Ala 

Asp 

Arg 

40 

Gly 

Asp 

Ala Ala 

Ala 

45 

Thr 

Asp 

Asp 

Pro 

Ala 

50 

Ala 

Arg 

Phe 

Gin 

Val 

55 

Gin 

Lys 

His 

Ser Trp 
60 

Asp 

Gly 

Leu 

Arg 

Ser 

65 

He 

lie 

His 

Gly 

Ser 

70 

Arg 

Lys 

Tyr 

i 

Ser 

Gly Leu 
75 

lie 

Val 

Asn 

Lys 

80 

Ala 

Pro 

His 

Asp 

Phe 

85 

Gin 

Phe 

Val 

Gin 

Lys 

90 

Thr Asp 

Glu 

Ser 

Gly 

95 

Pro 

His 

Ser 

His 

Arg 

100 

Leu 

Tyr 

Tyr 

Leu 

Gly 

105 

Met 

Pro Tyr 

Gly 

Ser 

110 

Arg 

Glu 

Asn 

Ser 

Leu 

115 

Leu 

Tyr 

Ser 

Glu 

lie 
12 0 

Pro 

Lys 

Lys Val 

Arg 

125 

Lys 

Glu 

Ala 

Leu 

Leu 

130 

Leu 

Leu 

Ser 

Trp 

Lys 

135 

Gin 

Met 

Leu 

Asp His 
14 0 

Phe 

Gin 

Ala 

Thr 

Pro 

145 

His 

His 

Gly 

Val 

Tyr 

150 

Ser 

Arg 

Glu 

Glu 

Glu Leu 
155 

Leu 

Arg 

Glu 

Arg 

160 

hys 

Arg Leu 

Gly 

Val 

165 

Phe 

Gly 

lie 

Thr 

Ser 

170 

Tyr Asp 

Phe 

His 

Ser 

175 

Glu 

Ser 

Gly Leu 

Phe 

Leu 

Phe 

Gin 

Ala 

Ser 

Asn 

Ser Leu 

Phe 

His 

Cys 

Arg 


2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4159 
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Asp 

Gly 

Gly 

180 

Lys 

Asn 

Gly 

Phe 

Met 

He 

Lys 

195 

Thr 

Gin 

Cys 

Ser 

Gly 

200 

Pro 

Ala 

210 

Asp 

Pro 

Ala 

Phe 

Phe 

215 

Ser 

Phe 

225 

Ala 

Asn 

lie 

Glu 

Thr 

230 

Gly 

Glu 

Glu 

Gly 

Leu 

Ser 

Asn 

245 

Val 

Leu 

Asp 

Asp 

Phe 

Val 

lie 

260 

Gin 

Glu 

Glu 

Phe 

Asp 

Pro 

Thr 

275 

Ala 

Ser 

Trp 

Glu 

Gly 

280 

Ser 

Leu 

290 

Tyr 

GlU 

Glu 

Val 

Asp 

295 

Glu 

Ser 

305 

Ser 

Pro 

Ala 

Leu 

Glu 

310 

GlU 

Arg 

Lys 

Thr 

Gly 

Ser 

Lys 

325 

Asn 

Pro 

Lys 

lie 

Thr 

Asp 

Ser 

340 

Gin 

Gly 

Lys 

lie 

val 

Gin 

Pro 

355 

Phe 

Ser 

Ser 

Leu 

Phe 

360 

Pro 

Gly 

370 

Trp 

Thr 

Arg 

Asp 

Gly 

375 

Lys 

Tyr 

385 

Pro 

Gin 

Gin 

Trp 

Leu 

390 

Gin 

Leu 

Val 

Pro 

Ser 

Thr 

Glu 

405 

Asn 

Glu 

Glu 

Gin 

Pro 

Arg 

Asn 

420 

Val 

Gin 

Pro 

Tyr 

Val 

Trp 

lie 

435 

Asn 

Val 

His 

Asp 

lie 

440 

Phe 

Glu 

450 

Asp 

Glu 

Leu 

Cys 

Phe 

455 

Leu 

Arg 

465 

Cys 

His 

Leu 

Tyr 

Lys 

470 

Val 

Thr 

Ala 

Trp 

Ser 

Glu 

Pro 

485 

Phe 

Ser 

Pro 

Gly 

Lys 

Glu 

Glu 

500 

lie 

Ala 

Leu 

Thr 

Ser 

His 

Gly 

515 

Ser 

Lys 

lie 

Trp 

Val 

520 

Asn 

Gin 

530 

Gly 

Thr 

Lys 

Asp 

Thr 

535 

Pro 

Leu 

545 

Tyr 

GlU 

Ala 

Ala 

Gly 

550 

Glu 

lie 

val 

His 

Ser 

Cys 

Ser 

565 

Met 

Ser 

Gin 

Asn 

Ser 

Ser 

Val 

580 

Ser 

Thr 

Pro 

Pro 

Cys 

Pro 

Asp 

595 

Asp 

Asp 

Pro 

Leu 

His 

600 

Lys 

Met 

610 

Glu 

Ala 

Ala 

Ser 

Cys 

615 

Pro 

Pro 

625 

His 

Phe 

His 

Thr 

Arg 

630 

Ser 

Asp 

val 

Pro 

His 

Ala 

Leu 

645 

Gin 

Pro 

Gly 

Lys 


185 

val 

Ser 

Pro 

Met 

Lys 

190 

Pro 

Leu 

Glu 

Arg 

Met 

Asp 

Pro 

205 

Lys 

lie 

Cys 

Pro 

lie 

Asn 

Asn 

220 

Ser 

Asp 

Leu 

Trp 

Val 

Arg 

Arg 

235 

Leu 

Thr 

Phe 

Cys 

His 

240 

Gin 

Pro 

250 

Lys 

Ser 

Ala 

Gly 

Val 

255 

Ala 

Thr 

265 

Arg 

Phe 

Thr 

Gly 

Tyr 

270 

Trp 

Trp 

Cys 

Glu 

Gly 

Leu 

Lys 

285 

Thr 

Leu 

Arg 

lie 

Glu 

Val 

GlU 

300 

Val 

lie 

His 

Val 

Pro 

Thr 

Asp 

315 

Ser 

Tyr 

Arg 

Tyr 

Pro 

320 

Arg 

Ala 

330 

Leu 

Lys 

Leu 

Ala 

Glu 

335 

Phe 

Gin 

345 

Ser 

Thr 

Gin 

Glu 

Lys 

350 

Glu 

Leu 

Val 

Lys 

Val 

Glu 

Tyr 

365 

lie 

Ala 

Arg 

Ala 

Ala 

Trp 

Ala 

380 

Met 

Phe 

Leu 

Asp 

Arg 

Leu 

Leu 

395 

Pro 

Pro 

Ala 

Leu 

Phe 

400 

lie 

Arg 

410 

Leu 

Ala 

Ser 

Ala 

Arg 

415 

Ala 

Val 

425 

Val 

Tyr 

Glu 

Glu 

Val 

430 

Thr 

Asn 

Val 

Tyr 

Pro 

Phe 

Pro 

445 

Gin 

Ser 

Glu 

Gly 

Ala 

Asn 

Glu 

460 

Cys 

Lys 

Thr 

Gly 

Phe 

Val 

Leu 

475 

Lys 

Ser 

Gin 

Gly 

Tyr 

480 

Asp 

Glu 

490 

Asp 

Glu 

Phe 

Lys 

Cys 

495 

Pro 

lie 

505 

Gly 

Glu 

Trp 

Glu 

Val 

510 

Leu 

Ala 

Arg 

Glu 

Glu 

Thr 

Lys 

525 

Leu 

Val 

Tyr 

Phe 

Glu 

His 

His 

540 

Leu 

Tyr 

Val 

Val 

Ser 

Arg 

Leu 

555 

Thr 

Thr 

Pro 

Gly 

Phe 

560 

Ser 

Pbe 

570 

Asp 

Met 

Phe 

Val 

Ser 

575 

His 

Tyr 

585 

Val 

His 

Val 

Tyr 

Lys 

590 

Leu 

Ser 

Gly 

Gin 

Pro 

Arg 

Phe 

605 

Trp 

Ala 

Ser 

Met 

Asp 

Tyr 

val 

620 

Pro 

Pro 

Glu 

lie 

Phe 

Arg 

Leu 

635 

Tyr 

Gly 

Met 

lie 

Tyr 

640 

Lys 

Lys 

650 

His 

Pro 

Thr 

Val 

Leu 

655 

Phe 

Val 
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* 





660 





665 





670 



Tyx 

Gly 

Gly 

Pro 

Gin 

val 

Gin 

Leu 

val 

Asn 

Asn 

Ser 

Phe 

Lys 

Gly 

lie 



675 





680 





685 




Lys 

Tyr 

Leu 

Arg 

Leu 

Asn 

Thr 

Leu 

Ala 

Ser 

Leu Gly 

Tyr 

Ala 

Val 

Val 


690 





695 





700 





Val 

He 

Asp 

Gly Arg 

Gly 

Ser 

Cys 

Gin 

Arg 

Gly Leu 

Arg 

Phe 

Glu 

Gly 

705 





710 





715 





720 

Ala 

Leu 

Lys 

Asn 

Gin 

Met 

Gly 

Gin 

Val 

Glu 

lie 

Glu 

Asp 

Gin 

Val 

Glu 





725 





730 





735 


Gly 

Leu 

Gin 

Phe 

Val 

Ala 

Glu 

Lys 

Tyr 

Gly 

Phe 

lie 

Asp 

Leu 

Ser 

Arg 




740 





745 





750 



Val 

Ala 

lie 

His 

Gly 

Trp 

Ser 

Tyr 

Gly 

Gly 

Phe 

Leu 

Ser 

Leu 

Met 

Gly 



755 





760 





765 



Leu 

lie 

His 

Lys 

Pro 

Gin 

val 

Phe 

Lys 

Ala 

Gin 

Pro 

Leu 

Ala 

Tyr 

Pro 


770 





775 





780 





Pro 

Arg 

Leu 

Pro 

Gly 

Arg 

Lys 

Arg 

Ala 

Leu 

Phe 

Pro 

His 

Lys 

Leu 

Pro 

785 





790 





795 





800 

Arg 

Leu 

Pro 

Thr 

Asp 

Pro 

Ser 

Arg 

Glu 

Thr 

Leu 

Pro 

Ala 

Pro 

Asp 

Leu 





805 





810 





815 


Pro 

Gin 

Arg 

Glu 

Thr 

Gin 

Tyr 

Ser 

Leu 

Pro 

Arg Val 

Gly 

Arg 

Ala 

Leu 




820 





825 





830 




<210> 32 

<211> 4076 

<212 > DMA 

<213 > Homo sapiens 

<400> 32 


caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 
tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 
tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 
cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 
gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 
cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 
gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 
gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 
cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 
gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 
tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 
gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 
ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 
atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 
ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 
gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 
gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 
gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 
gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 
tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 
gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 
aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 
ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 
gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 
cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 
gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 
tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 
gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 
ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 
gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 
tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 
aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 1920 
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aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 1980 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 2160 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggctct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 

ggcaggggct cctgtcagcg agggcttcgg ttcgaagggg ccctgaaaaa ccaaatgggc 2460 

caggtggaga tcgaggacca ggtggagggc ctgcagttcg tggccgagaa gtatggcttc 2520 

atcgacctga gccgagttgc catccatggc tggtcctacg ggggcttcct ctcgctcatg 2580 

gggctaatcc acaagcccca ggtgttcaag gcccaaccgc ttgcttatcc tccacggctt 2640 

cctggacgaa aacgtgcact ttttccacac aaacttcctc gtctcccaac tgatccgagc 2700 

agggaaacct taccagctcc agatctaccc caacgagaga cacagtattc gctgccccga 2760 

gtcgggcgag cactatgaag tcacgttgct gcactttcta caggaatacc tctgagcctg 2820 

cccaccggga gccgccacat cacagcacaa gtggctgcag cctccgcggg gaaccaggcg 2880 

ggagggactg agtggcccgc gggccccagt gaggcacttt gtcccgccca gcgctggcca 2940 

gccccgagga gccgctgcct tcaccgcccc gacgcctttt atcctttttt aaacgctctt 3000 

gggttttatg tccgctgctt cttggttgcc gagacagaga gatggtggtc tcgggccagc 3060 

ccctcctctc cccgccttct gggaggagga ggtcacacgc tgatgggcac tggagaggcc 3120 

agaagagact cagaggagcg ggctgccttc cgcctggggc tccctgtgac ctctcagtcc 3180 

cctggcccgg ccagccaccg tccccagcac ccaagcatgc aattgcctgt cccccccggc 3240 

cagcctcccc aacttgatgt ttgtgttttg tttgggggga tatttttcat aattatttaa 3300 

aagacaggcc gggcgcggtg gctcacgtct gtaatcccag cactttggga ggctgaggcg 3360 

ggcggatcac ctgaggttgg gagttcaaga ccagcctggc caacatgggg aaaccccgtc 3420 

tctactaaaa atacaaaaaa ttagccgggt gtggtggcgc gtgcctataa tcccagctac 3480 

tcgggaggct gaggcaggag aatcgcttga acccgggagg tggaggttgc ggtgagccaa 3540 

gatcgcacca ttgcactcca gcctgggcaa caagagcgaa actctgtctc aaaataaata 3600 

aaaaataaaa gacagaaagc aaggggtgcc taaatctaga cttggggtcc acaccgggca 3660 

gcggggttgc aacccagcac ctggtaggct ccatttcttc ccaagcccga ctttcaggca 3720 

ggcactgaaa cgcaccgaac ttccacgctc tgctggtcag tggcggctgt cccctcccca 3780 

gcccagccgc ccagccacat gtgtctgcct gacccgtaca caccaggggt tccggggttg 3840 

ggagctgaac catccccacc tcagggttat atttccctct ccccttccct ccccgccaag 3900 

agctctgcca ggggcgggca aaaaaaaaag taaaaagaaa agaaaaaaaa aaaaaagaaa 3960 

caaaccacct ctacatatta tggaaagaaa atatttttgt cgattcttat tcttttataa 4020 

ttatgcgtgg aagaagtaga cacattaaac gattccagtt ggaaacatgt cacctg 4076 

<210> 33 

<211> 879 

<212> PRT 

<213> Homo sapiens 

<400> 33 

Met Arg Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser 

1 5 10 15 

Trp Arg Ser Phe Ser Leu Asn Ser Glu Gly Ala Glu Arg Met Ala Thr 

20 25 30 

Thr Gly Thr Pro Thr Ala Asp Arg Gly Asp Ala Ala Ala Thr Asp Asp 
35 40 45 

Pro Ala Ala Arg Phe Gin Val Gin Lys His Ser Trp Asp Gly Leu Arg 
50 55 60 

Ser He lie His Gly Ser Arg Lys Tyr Ser Gly Leu lie Val Asn Lys 

65 70 75 80 

Ala Pro His Asp Phe Gin Phe Val Gin Lys Thr Asp Glu Ser Gly Pro 

85 90 95 

His Ser His Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly Ser Arg Glu 

100 105 110 

Asn Ser Leu Leu Tyr Ser Glu lie Pro Lys Lys Val Arg Lys Glu Ala 
115 120 125 

Leu Leu Leu Leu Ser Trp Lys Gin Met Leu Asp His Phe Gin Ala Thr 
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130 135 

Pro His His Gly Val Tyr Ser Arg 
145 150 

Lys Arg Leu Gly Val Phe Gly lie 

165 

Ser Gly Leu Phe Leu Phe Gin Ala 

180 

Asp Gly Gly Lys Asn Gly Phe Met 
195 200 

lie Lys Thr Gin Cys Ser Gly Pro 
210 215 

Ala Asp Pro Ala Phe Phe Ser Phe 
225 230 

Ala Asn lie Glu Thr Gly Glu Glu 

245 

Gly Leu Ser Asn Val Leu Asp Asp 

260 

Phe Val lie Gin Glu Glu Phe Asp 
275 200 

Pro Thr Ala Ser Trp Glu Gly Ser 

n a a 

Leu Tyr Glu Glu Val Asp Glu Ser 

305 310 

Ser Pro Ala Leu Glu Glu Arg Lys 

325 

Thr Gly Ser Lys Asn Pro Lys He 

340 

Thr Asp Ser Gin Gly Lys lie Val 
355 360 

Gin Pro Phe Ser Ser Leu Phe Pro 
370 375 

Gly Trp Thr Arg Asp Gly Lys Tyr 
385 390 

Pro Gin Gin Trp Leu Gin Leu Val 

405 

Pro Ser Thr Glu Asn Glu Glu Gin 

420 

Pro Arg Asn Val Gin Pro Tyr Val 
435 440 

Trp lie Asn Val His Asp lie Phe 
450 455 

Glu Asp Glu Leu Cys Phe Leu Arg 
465 470 

Cys His Leu Tyr Lys Val Thr Ala 

485 

Trp Ser Glu Pro Phe Ser Pro Gly 

500 


140 

Glu Glu Glu Leu Leu Arg Glu Arg 
155 160 

Thr Ser Tyr Asp Phe His Ser Glu 
170 175 

Ser Asn Ser Leu Phe His Cys Arg 
185 190 

Val Ser Pro Met Lys Pro Leu Glu 

205 

Arg Met Asp Pro Lys lie Cys Pro 

220 

lie Asn Asn Ser Asp Leu Trp val 
235 240 

Arg Arg Leu Thr Phe Cys His Gin 
250 255 

Pro Lys Ser Ala Gly Val Ala Thr 
265 270 

Arg Phe Thr Gly Tyr Trp Trp Cys 

285 

Glu Gly Leu Lys Thr Leu Arg He 

i oo 

Glu Val Glu Val lie His Val Pro 
315 320 

Thr Asp Ser Tyr Arg Tyr Pro Arg 
330 335 

Ala Leu Lys Leu Ala Glu Phe Gin 
345 350 

Ser Thr Gin Glu Lys Glu Leu Val 

365 

Lys Val Glu Tyr lie Ala Arg Ala 

380 

Ala Trp Ala Met Phe Leu Asp Arg 
395 400 

Leu Leu Pro Pro Ala Leu Phe lie 
410 415 

Arg Leu Ala Ser Ala Arg Ala Val 
425 430 

Val Tyr Glu Glu Val Thr Asn Val 

445 

Tyr Pro Phe Pro Gin Ser Glu Gly 

460 

Ala Asn Glu Cys Lys Thr Gly Phe 
475 480 

Val Leu Lys Ser Gin Gly Tyr Asp 
490 495 

Glu Asp Glu Phe Lys Cys Pro lie 
505 510 


Lys Glu 

His Gly 
530 


Glu lie 
515 

Ser Lys 


Ala Leu 
Gly Thr 


Thr ser 
520 
Lys Asp 
535 


Gly Glu 
Thr Pro 


Trp Glu 

Leu Glu 
540 


Val Leu 
525 

His His 


Ala Arg 
Leu Tyr 


Val Val Ser Tyr Glu Ala Ala Gly Glu lie Val Arg Leu Thr Thr Pro 
545 550 555 560 
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Gly Phe Ser His 

Ser His Tyr Ser 

580 

Leu Ser Gly Pro 
595 

Ala Ser Met Met 
610 

Glu He Phe His 
625 

lie Tyr Lys Pro 

Leu Phe Val Tyr 

66 0 

Lys Gly lie Lys 
675 

Ala Val Val Val 
690 

Phe Glu Gly Ala 
70S 

Gin Val Glu Gly 

Leu Ser Arg Val 

740 


Ser Cys Ser Met 
565 

Ser Val Ser Thr 

Asp Asp Asp Pro 

600 

Glu Ala Ala Ser 
615 

Phe His Thr Arg 
630 

His Ala Leu Gin 
645 

Gly Gly Pro Gin 

Tyr Leu Arg Leu 

680 

lie Asp Gly Arg 
695 

Leu Lys Asn Gin 
710 

Leu Gin Phe Val 
725 

Ala lie His Gly 


Ser Gin Asn Phe 
570 

Pro Pro Cys Val 
585 

Leu His Lys Gin 

Cys Pro Pro Asp 

620 

Ser Asp Val Arg 
635 

Pro Gly Lys Lys 
650 

Val Gin Leu Val 
665 

Asn Thr Leu Ala 

Gly Ser Cys Gin 

700 

Met Gly Gin Val 
715 

Ala Glu Lys Tyr 
730 

Trp Ser Tyr Gly 
745 


Asp Met Phe Val 
575 

His Val Tyr Lys 
590 

Pro Arg Phe Trp 
605 

Tyr Val Pro Pro 

Leu Tyr Gly Met 

640 

His Pro Thr Val 
655 

Asn Asn Ser Phe 
670 

Ser Leu Gly Tyr 
685 

Arg Gly Leu Arg 
Glu lie Glu Asp 

nn r\ 

* ^ w 

Gly Phe lie Asp 
735 

Gly Phe Leu Ser 
750 


Leu Met 

Gly Ala 
770 


Gly Leu 
755 

Pro Val 


lie His 
Thr Val 


Lys Pro 
760 
Trp Met 
775 


Gin Val 
Ala Tyr 


Phe Lys 

Asp Thr 
780 


Val Ala 
765 

Gly Tyr 


lie Ala 
Thr Glu 


Arg Tyr Met Asp 
785 

Ser Val Ala Leu 

Leu lie Leu His 

820 

Asn Phe Leu Val 
835 

Gin lie Tyr Pro 
850 

Glu His Tyr Glu 
865 


Val Pro Glu Asn 
790 

His Val Glu Lys 
805 

Gly Phe Leu Asp 

Ser Gin Leu lie 

840 

Asn Glu Arg His 
855 

Val Thr Leu Leu 
870 


Asn Gin His Gly 
795 

Leu Pro Asn Glu 
810 

Glu Asn Val His 
825 

Arg Ala Gly Lys 

Ser lie Arg Cys 

860 

His Phe Leu Gin 
875 


Tyr Glu Ala Gly 

800 

Pro Asn Arg Leu 
815 

Phe Phe His Thr 
830 

Pro Tyr Gin Leu 
845 

Pro Glu Ser Gly 
Glu Tyr Leu 


<210> 34 

<211> 4263 

<212> DNA 

<213> Homo sapiens 

<400> 34 

caggccgccg cctgggtcgc tcaacttccg 
tgtgtccgcc gcggctgtcg tcccccgctc 
tggagccgcg accgtgaggc gccgctggac 
cccacgtccc ggtctgtgtc ccacgcctgc 
gaaggcaccc ctgccctcct gaggtcagct 
cgcctggaca aggagaacac cggaagttgg 
gagaggatgg ccaccaccgg gaccccaacg 
gacccggccg cccgcttcca ggtgcagaag 
cacggcagcc gcaagtactc gggcctcatt 
gtgcagaaga cggatgagtc tgggccccac 
tatggcagcc gagagaactc cctcctctac 
gctctgctgc tcctgtcctg gaagcagatg 
ggggtctact ctcgggagga ggagctgctg 


ggtcaaaggt gcctgagccg gcgggtcccc 
ccgccacttc cggggtcgca gtcccgggca 
ccgggacgac ctgcccagtc cggccgccgc 
agctggaatg gaggctctct ggacccttta 
gagcggttaa tgcggaaggt taagaaactg 
agaagcttct cgctgaattc cgagggggct 
gccgaccgag gcgacgcagc cgccacagat 
cactcgtggg acgggctccg gagcatcatc 
gtcaacaagg cgccccacga cttccagttt 
tcccaccgcc tctactacct gggaatgcca 
tctgagattc ccaagaaggt ccggaaagag 
ctggatcatt tccaggccac gccccaccat 
a 999 a 9 c 99 a aacgcctggg ggtcttcggc 


60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 
ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 
gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 
gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 
gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 
gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 
tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 
gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 
aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 
ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 
gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 
c 999 atggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 
gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 
tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 
gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 
ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 
gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 
tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 
aggcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 1920 
tatgaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 1980 
atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 2040 
gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 2100 
tgggctagca tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 2160 
catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 2220 
cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 2280 
gtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 2340 
tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 2400 
gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 2460 
gtggccgaga agtatggctt catcgacctg agccgagttg ccatccatgg ctggtcctac 2520 
gggggcttcc tctcgctcat ggggctaatc cacaagcccc aggtgttcaa ggtggccatc 2580 
gcgggtgccc cggtcaccgt ctggatggcc tacgacacag ggtacactga gcgctacatg 2640 
gacgtccctg agaacaacca gcacggctat gaggcgggtt ccgtggccct gcacgtggag 2 700 
aagctgccca atgagcccaa ccgcttgctt atcctccacg gcttcctgga cgaaaacgtg 2760 
cactttttcc acacaaactt cctcgtctcc caactgatcc gagcagggaa accttaccag 2820 
ctccagatct accccaacga gagacacagt attcgctgcc ccgagtcggg cgagcactat 2880 
gaagtcacgt tgctgcactt tctacaggaa tacctctgag cctgcccacc gggagccgcc 2940 
acatcacagc acaagtggct gcagcctccg cggggaacca ggcgggaggg actgagtggc 3 00 0 
ccgcgggccc cagtgaggca ctttgtcccg cccagcgctg gccagccccg aggagccgct 3060 
gccttcaccg ccccgacgcc ttttatcctt ttttaaacgc tcttgggttt tatgtccgct 3120 
gcttcttggt tgccgagaca gagagatggt ggtctcgggc cagcccctcc tctccccgcc 3180 
ttctgggagg aggaggtcac acgctgatgg gcactggaga ggccagaaga gactcagagg 324 0 
agcgggctgc cttccgcctg gggctccctg tgacctctca gtcccctggc ccggccagcc 3300 
accgtcccca gcacccaagc atgcaattgc ctgtcccccc cggccagcct ccccaacttg 3360 
atgtttgtgt tttgtttggg gggatatttt tcataattat ttaaaagaca ggccgggcgc 3420 
ggtggctcac gtctgtaatc ccagcacttt gggaggctga ggcgggcgga tcacctgagg 3480 
ttgggagttc aagaccagcc tggccaacat ggggaaaccc cgtctctact aaaaatacaa 3540 
aaaattagcc gggtgtggtg gcgcgtgcct ataatcccag ctactcggga ggctgaggca 3600 
ggagaatcgc ttgaacccgg gaggtggagg ttgcggtgag ccaagatcgc accattgcac 3660 
tccagcctgg gcaacaagag cgaaactctg tctcaaaata aataaaaaat aaaagacaga 3720 
aagcaagggg tgcctaaatc tagacttggg gtccacaccg ggcagcgggg ttgcaaccca 3780 
gcacctggta ggctccattt cttcccaagc ccgagcagag ggtcatgcgg gccccacagg 3840 
agaagcggcc agggcccgcg gggggcacca cctgtggaca gccctcctgt ccccaagctt 3900 
tcaggcaggc actgaaacgc accgaacttc cacgctctgc tggtcagtgg cggctgtccc 3960 
ctccccagcc cagccgccca gccacatgtg tctgcctgac ccgtacacac caggggttcc 4020 
ggggttggga gctgaaccat ccccacctca gggttatatt tccctctccc cttccctccc 4080 
cgccaagagc tctgccaggg gcgggcaaaa aaaaaagtaa aaagaaaaga aaaaaaaaaa 4140 
aaagaaacaa accacctcta catattatgg aaagaaaata tttttgtcga ttcttattct 4200 
tttataatta tgcgtggaag aagtagacac attaaacgat tccagttgga aacatgtcac 4260 
ctg 4263 
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<210> 35 

<211> 879 

<212> PRT 

<213> Homo sapiens 

<400> 35 


Met 

Arg 

Lys 

Val 

Lys 

Lys 

Leu 

Arg 

Leu Asp 

Lys 

Glu 

Asn 

Thr 

Gly 

Ser 

l 




5 





10 





15 


Trp 

Arg 

Ser 

Phe 

Ser 

Leu 

Asn 

Ser 

Glu Gly 

Ala 

Glu 

Arg 

Met 

Ala 

Thr 




20 





25 





30 



Thr 

Gly 

Thr 

Pro 

Thr 

Ala 

Asp 

Arg 

Gly Asp 

Ala 

Ala 

Ala 

Thr 

Asp 

Asp 



35 





40 





45 




Pro 

Ala 

Ala 

Arg 

Phe 

Gin 

Val 

Gin 

Lys 

His 

Ser 

Trp 

Asp 

Gly 

Leu 

Arg 


50 





55 





60 





Ser 

He 

lie 

His 

Gly 

Ser 

Arg 

Lys 

Tyr 

Ser 

Gly 

Leu 

lie 

Val 

Asn 

Lys 

65 





70 





75 





80 

Ala 

Pro 

His 

Asp 

Phe 

Gin 

Phe 

Val 

Gin 

Lys 

Thr 

Asp 

Glu 

Ser 

Gly 

Pro 





85 





90 





95 


His 

Ser 

His 

Arg 

Leu 

Tyr 

Tyr 

Leu 

Gly Met 

Pro 

Tyr 

Gly 

Ser 

Arg 

Glu 




100 





105 





110 



Asn 

ser 

Leu 

Leu 

Tyr 

Ser 

Glu 

He 

Pro 

Lys 

Lys 

Val 

Arg 

Lys 

Glu 

Ala 



115 





120 



■ - 


125 




Leu 

Leu 

Leu 

Leu 

Ser 

Trp 

Lys 

Gin 

Met 

Leu 

Asp 

His 

Phe 

Gin 

Ala 

Thr 


130 





135 





140 





Pro 

His 

His 

Gly 

Val 

Tyr 

Ser 

Arg 

Glu 

Glu 

Glu 

Leu 

Leu 

Arg 

Glu 

Arg 

145 





150 





155 





160 

Lys 

Arg 

Leu 

Gly 

Val 

Phe 

Gly 

lie 

Thr 

Ser 

Tyr 

Asp 

Phe 

His 

Ser 

Glu 





165 





170 





175 


Ser 

Gly 

Leu 

Phe 

Leu 

Phe 

Gin 

Ala 

Ser 

Asn 

Ser 

Leu 

Phe 

His 

Cys 

Arg 




180 





185 





190 



Asp 

Gly 

Gly 

Lys 

Asn 

Gly 

Phe 

Met 

Val 

Ser 

Pro 

Met 

Lys 

Pro 

Leu 

Glu 



195 





200 





205 




lie 

Lys 

Thr 

Gin 

Cys 

Ser 

Gly 

Pro 

Arg 

Met 

Asp 

Pro 

Lys 

lie 

Cys 

Pro 


210 





215 





220 





Ala 

Asp 

Pro 

Ala 

Phe 

Phe 

Ser 

Phe 

lie 

Asn 

Asn 

Ser 

Asp 

Leu 

Trp 

val 

225 





230 





235 





240 

Ala 

Asn 

lie 

Glu 

Thr 

Gly 

Glu 

Glu 

Arg Arg 

Leu 

Thr 

Phe 

Cys 

His 

Gin 





245 





250 





255 


Gly 

Leu 

Ser 

Asn 

Val 

Leu 

Asp 

Asp 

Pro 

Lys 

Ser 

Ala 

Gly 

val 

Ala 

Thr 




260 





265 





270 



Phe 

Val 

lie 

Gin 

Glu 

Glu 

Phe 

Asp 

Arg 

Phe 

Thr 

Gly 

Tyr 

Trp 

Trp 

Cys 



275 





280 





285 




Pro 

Thr 

Ala 

Ser 

Trp 

GlU 

Gly 

Ser 

Glu 

Gly 

Leu 

Lys 

Thr 

Leu 

Arg 

He 


290 





295 





300 





Leu 

Tyr 

Glu 

GlU 

val 

Asp 

Glu 

Ser 

Glu 

Val 

Glu 

Val 

lie 

His 

Val 

Pro 

305 





310 





315 





320 

Ser 

Pro 

Ala 

Leu 

Glu 

Glu 

Arg 

Lys 

Thr Asp 

Ser 

Tyr 

Arg 

Tyr 

Pro 

Arg 





325 





330 





335 


Thr 

Gly 

Ser 

Lys 

Asn 

Pro 

Lys 

lie 

Ala 

Leu 

Lys 

Leu 

Ala 

Glu 

Phe 

Gin 




340 





345 





350 



Thr 

Asp 

Ser 

Gin 

Gly 

Lys 

lie 

Val 

Ser 

Thr 

Gin 

Glu 

Lys 

Glu 

Leu 

Val 



355 





360 





365 




Gin 

Pro 

Phe 

Ser 

Ser 

Leu 

Phe 

Pro 

Lys 

Val 

Glu 

Tyr 

lie 

Ala 

Arg 

Ala 


370 





375 





380 





Gly 

Trp 

Thr 

Arg 

Asp 

Gly 

Lys 

Tyr 

Ala 

Trp 

Ala 

Met 

Phe 

Leu 

Asp 

Arg 

385 





390 





395 





400 

Pro 

Gin 

Gin 

Trp 

Leu 

Gin 

Leu 

Val 

Leu 

Leu 

Pro 

Pro 

Ala 

Leu 

Phe 

lie 





405 





410 





415 
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t 


Pro Ser Thr Glu Asn Glu Glu Gin 

420 

Pro Arg Asn Val Gin Pro Tyr Val 
435 440 

Trp lie Asn Val His Asp lie Phe 
450 455 

Glu Asp Glu Leu Cvs Phe Leu Arq 
465 470 

Cys His Leu Tyr Lys Val Thr Ala 

485 

Trp Ser Glu Pro Phe Ser Pro Gly 

500 

Lys Glu Glu lie Ala Leu Thr Ser 
515 520 

His Gly Ser Lys Gly Thr Lys Asp 
530 535 

Val Val Ser Tyr Glu Ala Ala Gly 
545 550 

Gly Phe Ser His Ser Cys Ser Met 

565 

Ser His Tyr Ser Ser Val Ser Thr 

580 

Leu Ser Gly Pro Asp Asp Asp Pro 
595 600 

Ala Ser Met Met Glu Ala Ala Ser 
610 615 

Glu lie Phe His Phe His Thr Arg 
625 630 

lie Tyr Lys Pro His Ala Leu Gin 

645 

Leu Phe Val Tyr Gly Gly Pro Gin 

660 

Lys Gly lie Lys Tyr Leu Arg Leu 
675 680 

Ala Val Val Val lie Asp Gly Arg 
690 695 

Phe Glu Gly Ala Leu Lys Asn Gin 
705 710 

Gin Val Glu Gly Leu Gin Phe Val 

725 

Leu Ser Arg Val Ala lie His Gly 

740 

Leu Met Gly Leu lie His Lys Pro 
755 760 

Gly Ala Pro Val Thr Val Trp Met 
770 775 

Arg Tyr Met Asp Val Pro Glu Asn 
785 790 

Ser Val Ala Leu His Val Glu Lys 

805 

Leu lie Leu His Gly Phe Leu Asp 

820 

Asn Phe Leu Val Ser Gin Leu He 
835 840 

Gin lie Tyr Pro Asn Glu Arg His 
850 855 

Glu His Tyr Glu Val Thr Leu Leu 
865 870 


Arg Leu Ala Ser Ala Arg Ala Val 

425 430 

Val Tyr Glu Glu Val Thr Asn Val 

445 

Tyr Pro Phe Pro Gin Ser Glu Gly 

460 

Ala Asn Glu Cys Lys Thr Gly Phe 
475 480 

Val Leu Lys Ser Gin Gly Tyr Asp 
490 495 

Glu Asp Glu Phe Lys Cys Pro lie 
505 510 

Gly Glu Trp Glu Val Leu Ala Arg 

525 

Thr Pro Leu Glu His His Leu Tyr 

540 

Glu lie Val Arg Leu Thr Thr Pro 
555 560 

Ser Gin Asn Phe Asp Met Phe Val 
570 575 

Pro Pro Cys Val His Val Tyr Lys 
585 590 

Leu His Lys Gin Pro Arg Phe Trp 

605 

Cys Pro Pro Asp Tyr Val Pro Pro 

620 

Ser Asp Val Arg Leu Tyr Gly Met 
635 640 

Pro Gly Lys Lys His Pro Thr Val 
650 655 

Val Gin Leu Val Asn Asn Ser Phe 
665 670 

Asn Thr Leu Ala Ser Leu Gly Tyr 

685 

Gly Ser Cys Gin Arg Gly Leu Arg 

700 

Met Gly Gin Val Glu lie Glu Asp 
715 720 

Ala Glu Lys Tyr Gly Phe lie Asp 
730 735 

Trp Ser Tyr Gly Gly Phe Leu Ser 
745 750 

Gin Val Phe Lys Val Ala lie Ala 

765 

Ala Tyr Asp Thr Gly Tyr Thr Glu 

780 

Asn Gin His Gly Tyr Glu Ala Gly 
795 800 

Leu Pro Asn Glu Pro Asn Arg Leu 
810 815 

Glu Asn Val His Phe Phe His Thr 
825 830 

Arg Ala Gly Lys Pro Tyr Gin Leu 

845 

Ser lie Arg Cys Pro Glu Ser Gly 

860 

His Phe Leu Gin Glu Tyr Leu 
875 
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<210> 36 

<211> 4100 
<212 > DNA 
<213> Homo sapiens 
<400> 36 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg fcgtcccctat gaaaccgct:g 50 0 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg i860 

aggcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 1920 

tatgaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 1980 

atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 2040 

gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 2100 

tgggctagca tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 2160 

catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 2220 

cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 2280 

gtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 2340 

tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 2400 

gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 2460 

gtggccgaga agtatggctt catcgacctg agccgagttg ccatccatgg ctggtcctac 2520 

gggggcttce tctcgctcat ggggctaatc cacaagcccc aggtgttcaa ggtggccatc 2580 

gcgggtgccc cggtcaccgt ctggatggcc tacgacacag ggtacactga gcgctacatg 2640 

gacgtccctg agaacaacca gcacggctat gaggcgggtt ccgtggccct gcacgtggag 2700 

aagctgccca atgagcccaa ccgcttgctt atcctccacg gcttcctgga cgaaaacgtg 2760 

cactttttcc acacaaactt cctcgtctcc caactgatcc gagcagggaa accttaccag 2820 

ctccagatct accccaacga gagacacagt attcgctgcc ccgagtcggg cgagcactat 2880 

gaagtcacgt tgctgcactt tctacaggaa tacctctgag cctgcccacc gggagccgcc 2940 

acatcacagc acaagtggct gcagcctccg cggggaacca ggcgggaggg actgagtggc 3000 

ccgcgggccc cagtgaggca ctttgtcccg cccagcgctg gccagccccg aggagccgct 3060 

gccttcaccg ccccgacgcc ttttatcctt ttttaaacgc tcttgggttt tatgtccgct 3120 

gcttcttggt tgccgagaca gagagatggt ggtctcgggc cagcccctcc tctccccgcc 3180 
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ttctgggagg aggaggtcac acgctgatgg 
agcgggctgc cttccgcctg gggctccctg 
accgtcccca gcacccaagc atgcaattgc 
atgtttgtgt tttgtttggg gggatatttt 
ggtggctcac gtctgtaatc ccagcacttt 
ttgggagttc aagaccagcc tggccaacat 
aaaattagcc gggtgtggtg gcgcgtgcct 
ggagaatcgc ttgaacccgg gaggtggagg 
tccagcctgg gcaacaagag cgaaactctg 
aagcaagggg tgcctaaatc tagacttggg 
gcacctggta ggctccattt cttcccaagc 
gaacttccac gctctgctgg tcagtggcgg 
acatgtgtct gcctgacccg tacacaccag 
cacctcaggg ttatatttcc ctctcccctt 
ggcaaaaaaa aaagtaaaaa gaaaagaaaa 
attatggaaa gaaaatattt ttgtcgattc 
tagacacatt aaacgattcc 


<210> 37 

<211> 819 

<212> PRT 


<213> Homo 
<400> 37 

sapiens 




Met 

1 

Arg 

Lys 

val 

Lys 

5 

Lys 

Leu 

Arg 

Trp 

Arg 

Ser 

Phe 

20 

Ser 

Leu 

Asn 

Ser 

Thr 

Gly 

Thr 

35 

Pro 

Thr 

Ala 

Asp 

Arg 

40 

Pro 

Ala 

50 

Ala 

Arg 

Phe 

Gin 

val 

55 

Gin 

Ser 

65 

lie 

He 

His 

Gly 

Ser 

70 

Arg 

Lys 

Ala 

Pro 

His 

Asp 

Phe 

85 

Gin 

Phe 

val 

His 

Ser 

His 

Arg 

100 

Leu 

Tyr 

Tyr 

Leu 

Asn 

Ser 

Leu 

115 

Leu 

Tyr 

Ser 

Glu 

lie 

120 

Leu 

Leu 

130 

Leu 

Leu 

Ser 

Trp 

Lys 

135 

Gin 

Pro 

145 

His 

His 

Gly 

Val 

Tyr 

150 

Ser 

Arg 

Lys 

Arg 

Leu 

Gly 

Val 

165 

Phe 

Gly 

lie 

Ser 

Gly 

Leu 

Phe 

180 

Leu 

Phe 

Gin 

Ala 

Asp 

Gly 

Gly 

195 

Lys 

Asn 

Gly 

Phe 

Met 

200 

lie 

Lys 

210 

Thr 

Gin 

Cys 

Ser 

Gly 

215 

Pro 

Ala 

225 

Asp 

Pro 

Ala 

Phe 

Phe 

230 

Ser 

Phe 

Ala 

Asn 

lie 

Glu 

Thr 

245 

Gly 

Glu 

Glu 

Gly 

Leu 

Ser 

Asn 

260 

Val 

Leu 

Asp 

Asp 


gcactggaga ggccagaaga gactcagagg 
tgacctctca gtcccctggc ccggccagcc 
ctgtcccccc cggccagcct ccccaacttg 
tcataattat ttaaaagaca ggccgggcgc 
gggaggctga ggcgggcgga tcacctgagg 
ggggaaaccc cgtctctact aaaaatacaa 
ataatcccag ctactcggga ggctgaggca 
ttgcggtgag ccaagatcgc accattgcac 
tctcaaaata aataaaaaat aaaagacaga 
gtccacaccg ggcagcgggg ttgcaaccca 
ccgactttca ggcaggcact gaaacgcacc 
ctgtcccctc cccagcccag ccgcccagcc 
gggttccggg gttgggagct gaaccatccc 
ccctccccgc caagagctct gccaggggcg 
aaaaaaaaaa gaaacaaacc acctctacat 
ttattctttt ataattatgc gtggaagaag 
atgtcacctg 


Leu Asp 
10 

Lys 

Glu 

Asn 

Thr 

Gly 

15 

Ser 

Glu Gly 
25 

Ala 

Glu 

Arg 

Met 

30 

Ala 

Thr 

Gly Asp 

Ala 

Ala 

Ala 

45 

Thr 

Asp 

Asp 

Lys His 

Ser 

Trp 

60 

Asp 

Gly 

Leu 

Arg 

Tyr Ser 

Gly 

75 

Leu 

lie 

Val 

Asn 

Lys 

80 

Gin Lys 
90 

Thr 

Asp 

Glu 

Ser 

Gly 

95 

Pro 

Gly Met 
105 

Pro 

Tyr 

Gly 

Ser 

110 

Arg 

Glu 

Pro Lys 

Lys 

Val 

Arg 

125 

Lys 

Glu 

Ala 

Met Leu 

Asp 

His 

140 

Phe 

Gin 

Ala 

Thr 

Glu Glu 

GlU 

155 

Leu 

Leu 

Arg 

Glu 

Arg 

160 

Thr Ser 
170 

Tyr 

Asp 

Phe 

His 

Ser 

175 

Glu 

Ser Asn 

185 

Ser 

Leu 

Phe 

His 

190 

Cys 

Arg 

Val Ser 

Pro 

Met 

Lys 

205 

Pro 

Leu 

Glu 

Arg Met 

Asp 

Pro 

220 

Lys 

lie 

Cys 

Pro 

lie Asn 

Asn 

235 

Ser 

Asp 

Leu 

Trp 

Val 

240 

Arg Arg 
250 

Leu 

Thr 

Phe 

Cys 

His 

255 

Gin 

Pro Lys 
265 

Ser 

Ala 

Gly 

Val 

270 

Ala 

Thr 


3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4180 
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Phe Val 

lie 

Gin 

Glu 

Glu 

Phe Asp Arg Phe 

Thr 

Gly Tyr 

Trp 
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Trp Cys 

Pro 

Thr 

275 

Ala 

Ser 

Trp 

280 

Glu Gly Ser Glu Gly 

Leu 

Lys 

285 

Thr 

Leu Arg lie 

Leu 

290 

Tyr 

Glu 

Glu 

Val 

Asp 

295 

Glu Ser Glu Val 

Glu 

300 

Val 

lie 

His 

Val Pro 

305 

Ser 

Pro 

Ala 

Leu 

Glu 

310 

Glu Arg Lys Thr Asp 

315 

Ser 

Tyr Arg 

Tyr 

320 

Pro Arg 

Thr 

Gly 

Ser 

Lys 

325 

Asn 

Pro 

330 

Lys lie Ala Leu 

Lys 

Leu 

Ala 

Glu 

335 

Phe Gin 

Thr 

Asp 

Ser 

340 

Gin Gly 

Lys 

345 

lie Val Ser Thr 

Gin 

GlU 

Lys 

350 

GlU 

Leu Val 

Gin 

Pro 

355 

Phe 

Ser 

Ser 

Leu 

360 

Phe Pro Lys Val 

GlU 

Tyr 

365 

lie 

Ala 

Arg Ala 

Gly 

370 

Trp 

Thr 

Arg 

Asp 

375 

Gly Lys Tyr Ala Trp 

Ala 

380 

Met 

Phe 

Leu 

Asp Arg 

385 

Pro 

Gin 

Gin 

Trp 

Leu 

390 

Gin 

Leu Val Leu Leu 

395 

Pro 

Pro 

Ala 

Leu 

400 

Phe lie 

Pro 

Ser 

Thr 

Glu 

405 

Asn 

Glu 

410 

Glu Gin Arg Leu 

Ala 

Ser 

Ala 

415 

Arg Ala Val 




420 


n 

425 

(TUr* TT-tl TT-.1 

r> 1 

r*T «« 

T7-» T 

430 

rpU >*- 

Tt «« t r-» 1 

PiO 


nou 

v ax 

uxu 

tr a. w 

x j v ax v ax 4 jf ju 

W X U 

wx u 

v a x 

X U4 

rvo v va 

Trp 

lie 

435 

Asn 

Val 

His 

ASp 

440 

He Phe Tyr Pro 

Phe 

Pro 

445 

Gin 

Ser 

Glu Gly 

Glu 

450 

Asp 

Glu 

Leu 

Cys 

Phe 

455 

Leu Arg Ala Asn 

Glu 

460 

Cys 

Lys 

Thr 

Gly Phe 

465 

Cys 

His 

Leu 

Tyr 

Lys 

470 

Val 

Thr Ala Val Leu 

475 

Lys 

Ser 

Gin 

480 

Gly Tyr Asp 

Trp 

Ser 

Glu 

Pro 

485 

Phe 

Ser 

490 

Pro Gly Glu Asp 

Glu 

Phe 

Lys 

Cys 

495 

Pro lie 

Lys 

Glu 

Glu 

500 

lie 

Ala 

Leu 

505 

Thr Ser Gly Glu 

Trp 

Glu 

Val 

510 

Leu 

Ala Arg 

His 

Gly 

515 

Ser 

Lys 

Gly 

Thr 

520 

Lys Asp Thr Pro 

Leu 

Glu 

525 

His 

His 

Leu Tyr 

Val 

530 

val 

Ser 

Tyr 

Glu 

Ala 

535 

Ala Gly Glu lie 

Val 

540 

Arg 

Leu 

Thr 

Thr Pro 

545 

Gly 

Phe 

Ser 

His 

Ser 

550 

Cys 

Ser Met Ser Gin 

555 

Asn 

Phe 

Asp 

Met 

560 

Phe Val 

Ser 

His 

Tyr 

Ser 

565 

Ser 

val 

570 

Ser Thr Pro Pro 

Cys 

Val 

His 

Val 

575 

Tyr Lys 

Leu 

Ser 

Gly 

580 

Pro 

Asp 

585 

Asp Asp Pro Leu His 

Lys 

Gin 

Pro 

590 

Arg 

Phe Trp 

Ala 

Ser 

595 

Met 

Met 

Glu 

Ala 

600 

Ala Ser Cys Pro 

Pro 

Asp 

605 

Tyr 

val 

Pro Pro 

Glu 

610 

lie 

Phe 

His 

Phe 

His 

615 

Thr Arg Ser Asp 

val 

620 

Arg 

Leu 

Tyr 

Gly Met 

625 

lie 

Tyr 

Lys 

Pro 

His 

630 

Ala 

635 

Leu Gin Pro Gly Lys 

Lys 

His 

Pro 

640 

Thr Val 

Leu 

Phe 

Val 

Tyr 

645 

Gly 

Gly 

650 

Pro Gin Val Gin 

Leu 

Val 

Asn 

Asn 

655 

Ser Phe 

Lys 

Gly 

lie 

660 

Lys 

Tyr 

Leu 

665 

Arg Leu Asn Thr 

Leu 

Ala 

Ser 

670 

Leu 

Gly Tyr 

Ala 

val 

675 

Val 

val 

lie 

Asp 

680 

Gly Arg Gly Ser 

Cys 

Gin 

685 

Arg 

Gly Leu Arg 

Phe 

690 

Glu 

Gly 

Ala 

Leu 

Lys 

695 

Asn Gin Met Gly Gin 

700 

val 

Glu 

lie 

Glu Asp 

70S 

Gin 

Val 

Glu 

Gly 

Leu 

710 

Gin 

Phe Val Ala Glu 

715 

Lys 

Tyr 

Gly 

Phe 

720 
lie Asp 




725 


730 





735 
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Leu Ser Arg Val Ala lie His Gly Trp Ser Tyr Gly Gly Phe Leu Ser 

740 745 750 

Leu Met Gly Leu lie His Lys Pro Gin Val Phe Lys Ala Gin Pro Leu 
755 760 765 

Ala Tyr Pro Pro Arg Leu Pro Gly Arg Lys Arg Ala Leu Phe Pro His 
770 775 780 

Lys Leu Pro Arg Leu Pro Thr Asp Pro Ser Arg Glu Thr Leu Pro Ala 

785 790 795 800 

Pro Asp Leu Pro Gin Arg Glu Thr Gin Tyr Ser Leu Pro Arg Val Gly 

805 810 815 

Arg Ala Leu 

<210> 38 

<211> 4120 

<212> DNA 

<213 > Homo sapiens 

<400> 38 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtcegcc gcggctgtcg tcccocgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 7 80 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 162 0 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 1920 

tatgaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc i960 

atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 2040 

gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 2100 

tgggctagca tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 2160 

catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 2220 

cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 2280 

gtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 2340 

tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 2400 

gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 2460 
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gtggccgaga agtatggctt catcgacctg 
gggggcttcc tctcgctcat ggggctaatc 
cttgcttatc ctccacggct tcctggacga 
cgtctcccaa ctgatccgag cagggaaacc 
acacagtatt cgctgccccg agtcgggcga 
acaggaatac ctctgagcct gcccaccggg 
gcctccgcgg ggaaccaggc gggagggact 
tgtcccgccc agcgctggcc agccccgagg 
tatccttttt taaacgctct tgggttttat 
agatggtggt ctcgggccag cccctcctct 
ctgatgggca ctggagaggc cagaagagac 
ctccctgtga cctctcagtc ccctggcccg 
caattgcctg tcccccccgg ccagcctccc 
atatttttca taattattta aaagacaggc 
gcactttggg aggctgaggc gggcggatca 
ccaacatggg gaaaccccgt ctctactaaa 
cgtgcctata atcccagcta ctcgggaggc 
gtggaggttg cggtgagcca agatcgcacc 
aactctgtct caaaataaat aaaaaataaa 
acttggggtc cacaccgggc agcggggttg 
cccaagcccg agcagaggyt catycgggcc 
ggcaccacct gtggacagcc ctcctgtccc 
gaacttccac gctctgctgg tcagtggcgg 
acatgtgtct gcctgacccg tacacaccag 
cacctcaggg ttatatttcc ctctcccctt 
ggcaaaaaaa aaagtaaaaa gaaaagaaaa 
attatggaaa gaaaatattt ttgtcgattc 
tagacacatt aaacgattcc agttggaaac 


<210> 

<211> 

<212> 

<213 > ] 

<400> 

39 

819 

PRT 

Homo 

39 

sapiens 




Met 

1 

Arg 

Lys 

Val 

Lys 

5 

Lys 

Leu 

Arg 

Trp 

Arg 

Ser 

Phe 

20 

Ser 

Leu 

Asn 

Ser 

Thr 

Gly 

Thr 

35 

Pro 

Thr 

Ala 

Asp 

Arg 

40 

Pro 

Ala 

50 

Ala 

Arg 

Phe 

Gin 

Val 

55 

Gin 

Ser 

65 

He 

He 

His 

Gly 

Ser 

70 

Arg 

Lys 

Ala 

Pro 

His 

Asp 

Phe 

85 

Gin 

Phe 

Val 

His 

Ser 

His 

Arg 

100 

Leu 

Tyr 

Tyr 

Leu 

Asn 

Ser 

Leu 

115 

Leu 

Tyr 

Ser 

Glu 

lie 

120 

Leu 

Leu 

130 

Leu 

Leu 

Ser 

Trp 

Lys 

135 

Gin 

Pro 

145 

His 

His 

Gly 

Val 

Tyr 

150 

Ser 

Arg 

Lys 

Arg 

Leu 

Gly 

val 

165 

Phe 

Gly 

lie 

Ser 

Gly 

Leu 

Phe 

180 

Leu 

Phe 

Gin 

Ala 


agccgagttg ccatccatgg ctggtcctac 
cacaagcccc aggtgttcaa ggcccaaccg 
aaacgtgcac tttttccaca caaacttcct 
ttaccagctc cagatctacc ccaacgagag 
gcactatgaa gtcacgttgc tgcactttct 
agccgccaca tcacagcaca agtggctgca 
gagtggcccg cgggccccag tgaggcactt 
agccgctgcc ttcaccgccc cgacgccttt 
gtccgctgct tcttggttgc cgagacagag 
ccccgccttc tgggaggagg aggtcacacg 
tcagaggagc gggctgcctt ccgcctgggg 
gccagccacc gtccccagca cccaagcatg 
caacttgatg tttgtgtttt gtttgggggg 
cgggcgcggt ggctcacgtc tgtaatccca 
cctgaggttg ggagttcaag accagcctgg 
aatacaaaaa attagccggg tgtggtggcg 
tgaggcagga gaatcgcttg aacccgggag 
attgcactcc agcctgggca acaagagcga 
agacagaaag caaggggtgc ctaaatctag 
caacccagca cctggtaggc tccatttctt 
ccacaggaga agcggccagg gcccgcgggg 
caagctttca ggcaggcact gaaacgcacc 
ctgtcccctc cccagcccag ccgcccagcc 
gggttccggg gttgggagct gaaccatccc 
ccctccccgc caagagctct gccaggggcg 
aaaaaaaaaa gaaacaaacc acctctacat 
ttattctttt ataattatgc gtggaagaag 
atgtcacctg 


Leu 

Asp 

10 

Lys 

Glu 

Asn 

Thr 

Gly 

15 

Ser 

Glu 

25 

Gly 

Ala 

Glu 

Arg 

Met 

30 

Ala 

Thr 

Gly 

Asp 

Ala 

Ala 

Ala 

45 

Thr 

Asp 

Asp 

Lys 

His 

Ser 

Trp 

60 

Asp 

Gly 

Leu 

Arg 

Tyr 

Ser 

Gly 

75 

Leu 

lie 

Val 

Asn 

Lys 

80 

Gin 

Lys 

90 

Thr 

Asp 

Glu 

Ser 

Gly 

95 

Pro 

Gly 

105 

Met 

Pro 

Tyr 

Gly 

Ser 

110 

Arg 

Glu 

Pro 

Lys 

Lys 

Val 

Arg 

125 

Lys 

Glu 

Ala 

Met 

Leu 

Asp 

His 

140 

Phe 

Gin 

Ala 

Thr 

Glu 

Glu 

Glu 

155 

Leu 

Leu 

Arg 

Glu 

Arg 

160 

Thr 

Ser 

170 

Tyr 

Asp 

Phe 

His 

Ser 

175 

Glu 

Ser 

185 

Asn 

Ser 

Leu 

Phe 

His 

190 

Cys 

Arg 


2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

■*» n n n 

J I A V 

3780 

3840 

3900 

3960 

4020 

4080 

4120 
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Asp 

Gly 

Gly 

Lys 

Asn 

Gly 

Phe 

Met 

val 

Ser 

Pro 

Met 

Lys 

Pro 

Leu 

Glu 



195 





200 





205 




lie 

Lys 

Thr 

Gin 

Cys 

Ser 

Gly 

Pro 

Arg 

Met 

Asp 

Pro 

Lys 

lie 

Cys 

Pro 


210 





215 





220 





Ala 

Asp 

Pro 

Ala 

Phe 

Phe 

Ser 

Phe 

lie 

Asn 

Asn 

Ser 

Asp 

Leu 

Trp 

val 

225 





230 





235 





240 

Ala 

Asn 

lie 

Glu 

Thr 

Gly 

Glu 

Glu 

Arg 

Arg 

Leu 

Thr 

Phe 

Cys 

His 

Gin 





245 





250 





255 


Gly 

Leu 

Ser 

Asn 

val 

Leu 

Asp 

Asp 

Pro 

Lys 

Ser 

Ala 

Gly 

Val 

Ala 

Thr 




260 





265 





270 



Fhe 

Val 

lie 

Gin 

Glu 

Glu 

Phe 

Asp 

Arg 

Phe 

Thr 

Gly 

Tyr 

Trp 

Trp 

Cys 



275 





280 





285 




Pro 

Thr 

Ala 

Ser 

Trp 

Glu 

Gly 

Ser 

Glu 

Gly 

Leu 

Lys 

Thr 

Leu 

Arg 

lie 


290 





295 





300 





Leu 

Tyr 

Glu 

Glu 

Val 

Asp 

Glu 

Ser 

Glu 

Val 

Glu 

Val 

lie 

His 

Val 

Pro 

305 





310 





315 





320 

Ser 

Pro 

Ala 

Leu 

Glu 

Glu 

Arg 

Lys 

Thr 

Asp 

Ser 

Tyr 

Arg 

Tyr 

Pro 

Arg 





325 





330 





335 


Thr 

Gly 

Ser 

Lys 

Asn 

Pro 

Lys 

lie 

Ala 

Leu 

Lys 

Leu 

Ala 

Glu 

Phe 

Gin 




340 





345 





350 



Thr 

Asp 

Ser 

Gin 

Gly 

Lys 

lie 

vai 

Ser 

Thr 

Gin 

Giu 

Lys 

Glu 

Leu 

Val 



355 





360 





365 




Gin 

Pro 

Phe 

Ser 

Ser 

Leu 

Phe 

Pro 

Lys 

Val 

Glu 

Tyr 

He 

Ala 

Arg 

Ala 


370 





375 





380 





Gly 

Trp 

Thr 

Arg 

Asp 

Gly 

Lys 

Tyr 

Ala 

Trp 

Ala 

Met 

Phe 

Leu 

Asp 

Arg 

385 





390 





395 





400 

Pro 

Gin 

Gin 

Trp 

Leu 

Gin 

Leu 

Val 

Leu 

Leu 

Pro 

Pro 

Ala 

Leu 

Phe 

lie 





405 





410 





415 


Pro 

Ser 

Thr 

Glu 

Asn 

Glu 

GlU 

Gin 

Arg 

Leu 

Ala 

Ser 

Ala 

Arg 

Ala 

Val 




420 





425 





430 



Pro 

Arg 

Asn 

Val 

Gin 

Pro 

Tyr 

Val 

Val 

Tyr 

Glu 

Glu 

Val 

Thr 

Asn 

val 



435 





440 





445 




Trp 

lie 

Asn 

Val 

His 

Asp 

lie 

Phe 

Tyr 

Pro 

Phe 

Pro 

Gin 

Ser 

Glu 

Gly 


450 





455 





460 





Glu 

Asp 

Glu 

Leu 

Cys 

Phe 

Leu 

Arg 

Ala 

Asn 

Glu 

Cys 

Lys 

Thr 

Gly 

Phe 

465 





470 





475 





480 

Cys 

His 

Leu 

Tyr 

Lys 

Val 

Thr 

Ala 

Val 

Leu 

Lys 

Ser 

Gin 

Gly 

Tyr 

Asp 





485 





490 





495 


Trp 

Ser 

Glu 

Pro 

Phe 

Ser 

Pro 

Gly 

Glu 

Asp 

Glu 

Phe 

Lys 

Cys 

Pro 

lie 




500 





505 





510 



Lys 

Glu 

Glu 

lie 

Ala 

Leu 

Thr 

Ser 

Gly 

Glu 

Trp 

Glu 

Val 

Leu 

Ala 

Arg 



515 





520 





525 




His 

Gly 

Ser 

Lys 

Gly 

Thr 

Lys 

Asp 

Thr 

Pro 

Leu 

Glu 

His 

His 

Leu 

Tyr 


530 





535 





540 





Val 

val 

Ser 

Tyr 

Glu 

Ala 

Ala 

Gly 

Glu 

lie 

Val 

Arg 

Leu 

Thr 

Thr 

Pro 

545 





550 





555 





560 

Gly 

Phe 

Ser 

His 

Ser 

Cys 

Ser 

Met 

Ser 

Gin 

Asn 

Phe 

Asp 

Met 

Phe 

Val 





565 





570 





575 


Ser 

His 

Tyr 

Ser 

Ser 

Val 

Ser 

Thr 

Pro 

Pro 

Cys 

Val 

His 

Val 

Tyr 

Lys 




580 





585 





590 



Leu 

Ser 

Gly 

Pro 

Asp 

Asp 

Asp 

Pro 

Leu 

His 

Lys 

Gin 

Pro 

Arg 

Phe 

Trp 



595 





600 





605 




Ala 

Ser 

Met 

Met 

Glu 

Ala 

Ala 

Ser 

Cys 

Pro 

Pro 

Asp 

Tyr 

val 

Pro 

Pro 


610 





615 





620 





Glu 

He 

Phe 

His 

Phe 

His 

Thr 

Arg 

Ser 

Asp 

Val 

Arg 

Leu 

Tyr 

Gly 

Met 

625 





630 





635 





640 

lie 

Tyr 

Lys 

Pro 

His 

Ala 

Leu 

Gin 

Pro 

Gly 

Lys 

Lys 

His 

Pro 

Thr 

Val 





645 





650 





655 




WO 02/31134 


PCT/USO 1/31874 


Leu 

Lys 

Ala 

Phe 

705 

Gin 

Leu 

Leu 

Ala 


Phe Val Tyr 
€60 

Gly He Lys 
675 

Val Val Val 
690 

Glu Gly Ala 


Gly Gly Pro Gin Val Gin Leu 

665 

Tyr Leu Arg Leu Asn Thr Leu 

680 

lie Asp Gly Arg Gly Ser Cys 
695 

Leu Lys Asn Gin 


Val Asn 


Asn Ser Phe 
670 

Leu Gly Tyr 


710 

Val Glu Gly Leu Gin Phe Val 

725 

Ala He His Gly 


Lys 

785 

Pro 


Ser Arg Val 
740 

Met Gly Leu 
755 

Tyr Pro Pro 
770 

Leu Pro Arg 


Asp Leu Pro 


Arg Ala Leu 


He His Lys Pro 

760 

Arg Leu Pro Gly 
775 

Leu Pro Thr Asp 
790 

Gin Arg Glu Thr 
805 


Met Gly Gin 
715 

Ala Glu Lys 
730 

Trp Ser Tyr 
745 

Gin Val Phe 


Ala Ser 
685 

Gin Arg Gly Leu Arg 
700 

Val Glu 


Tyr Gly 
Gly Gly 


Lys Ala 
765 

Arg Lys Arg Ala Leu 

780 

Glu Thr 


Pro Ser Ara 
795 

Gin Tyr Ser 


Leu Pro 


n i a 
UlU 


lie Glu Asp 
720 

Phe He Asp 
735 

Phe Leu Ser 
750 

Gin Pro Leu 

Phe Pro His 

Leu Pro Ala 
800 

Arg Val Gly 
815 


< 210 > 
< 211 > 
<212 > 
<213 > 
<4 00> 


40 

4037 

DNA 

Homo sapiens 
40 


caggccgccg cctgggtcgc 
tgtgtccgcc gcggctgtcg tcccccgctc 
tggagccgcg accgtgaggc gccgctggac 
cccacgtccc ggtctgtgtc ccacgcctgc 
gaaggcaccc ctgccctcct gaggtcagct 
cgcctggaca aggagaacac cggaagttgg 
gagaggatgg ccaccaccgg gaccccaacg 
gacccggccg cccgcttcca ggtgcagaag 
cacggcagcc gcaagtactc gggcctcatt 
gtgcagaaga cggatgagtc tgggccccac 
tatggcagcc gagagaactc cctcctctac 
gctctgctgc tcctgtcctg gaagcagatg 
£T9ggtctact ctcgggagga ggagctgctg 
atcacctcct acgacttcca cagcgagagt 
ctcttccact gccgcgacgg cggcaagaac 
gaaatcaaga cccagtgctc agggccccgg 
gccttcttct ccttcatcaa taacagcgac 
9 a G c 99 c ggc tgaccttctg ccaccaaggt 
gcgggtgtgg ccaccttcgt catacaggaa 
tgccccacag cctcctggga aggttcagag 
gaagtcgatg agtccgaggt ggaggtcatt 
aagacggact cgtatcggta ccccaggaca 
ctggctgagt tccagactga cagccagggc 
gtgcagccct tcagctcgct gttcccgaag 
cgggatggca aatacgcctg ggccatgttc 
gtcctcctcc ccccggccct gttcatcccg 
tctgccagag ctgtccccag gaatgtccag 
gtctggatca atgttcatga catcttctat 
ctctgctttc tccgcgccaa tgaatgcaag 
gccgttttaa aatcccaggg ctacgattgg 
tttaagtgcc ccattaagga agagattgct 


ggtcaaaggt gcctgagccg gcgggtcccc 
ccgccacttc cggggtcgca gtcccgggca 
ccgggacgac ctgcccagtc cggccgccgc 
agctggaatg gaggctctct ggacccttta 
gagcggttaa tgcggaaggt taagaaactg 
agaagcttct cgctgaattc cgagggggct 
gccgaccgag gcgacgcagc cgccacagat 
cactcgtggg acgggctccg gagcatcatc 
gtcaacaagg cgccccacga cttccagttt 
tcccaccgcc tctactacct gggaatgcca 
tctgagattc ccaagaaggt ccggaaagag 
ct 99*tcatt tccaggccac gccccaccat 
agggagcgga aacgcctggg ggtcttcggc 
ggcctcttcc tcttccaggc cagcaacagc 
ggcttcatgg tgtcccctat gaaaccgctg 
at 99*cccca aaatctgccc tgccgaccct 
ctgtgggtgg ccaacatcga gacaggcgag 
ttatccaatg tcctggatga ccccaagtct 
gagttcgacc gcttcactgg gtactggtgg 
ggcctcaaga cgctgcgaat cctgtatgag 
cacgtcccct ctcctgcgct agaagaaagg 
ggcagcaaga atcccaagat tgccttgaaa 
aagatcgtct cgacccagga gaaggagctg 
gtggagtaca tcgccagggc cgggtggacc 
ctggaccggc cccagcagtg gctccagctc 
agcacagaga atgaggagca gcggctagcc 
ccgtatgtgg tgtacgagga ggtcaccaac 
cccttccccc aatcagaggg agaggacgag 
accggcttct gccatttgta caaagtcac? 
agtgagccct tcagccccgg ggaagatgaa 
ctgaccagcg gtgaatggga ggttttggcg 
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a ggcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 
tatgaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 
atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 
gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 

tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 
catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 
cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 
gtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 
tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 
gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 
gtggccgaga agtatggctt catcgacctg agccgagttg ccatccatgg ctggtcctac 
gggggcttcc tctcgctcat ggggctaatc cacaagcccc aggtgttcaa ggcccaaccg 
cttgcttatc ctccacggct tcctggacga aaacgtgcac tttttccaca caaacttcct 
cgtctcccaa ctgatccgag cagggaaacc ttaccagctc cagatctacc ccaacgagag 
acacagtatt cgctgccccg agtcgggcga gcactatgaa gtcacgttgc tgcactttct 
acaggaatac ctctgagcct gcccaccggg agccgccaca tcacagcaca agtggctgca 
gcctccgcgg ggaaccaggc gggagggact gagtggcccg cgggccccag tgaggcactt 
tgtcccgccc agcgctggcc agccccgagg agccgctgcc ttcaccgccc cgacgccttt 
tatccttttt taaacgctct tgggttttat gtccgctgct tcttggttgc cgagacagag 
agatggtggt ctcgggccag cccctcctct ccccgccttc tgggaggagg aggtcacacg 
ctgatgggca ctggagaggc cagaagagac tcagaggagc gggctgcctt ccgcctgggg 
ctccctgtga cctctcagtc ccctggcccg gccagccacc gtccccagca cccaagcatg 
caattgcctg tcccccccgg ccagcctccc caacttgatg tttgtgtttt gtttgggggg 
atatttttca taattattta aaagacaggc cgggcgcggt ggctcacgtc tgtaatccca 
gcactttggg aggctgaggc gggcggatca cctgaggttg ggagttcaag accagcctgg 
ccaacatggg gaaaccccgt ctctactaaa aatacaaaaa attagccggg tgtggtggcg 
cgtgcctata atcccagcta ctcgggaggc tgaggcagga gaatcgcttg aacccgggag 
gtggaggttg cggtgagcca agatcgcacc attgcactcc agcctgggca acaagagcga 
aactctgtct caaaataaat aaaaaataaa agacagaaag caaggggtgc ctaaatctag 
acttggggtc cacaccgggc agcggggttg caacccagca cctggtaggc tccatttctt 
cccaagcccg actttcaggc aggcactgaa acgcaccgaa cttccacgct ctgctggtca 
gtggcggctg tcccctcccc agcccagccg cccagccaca tgtgtctgcc tgacccgtac 
acaccagggg ttccggggtt gggagctgaa ccatccccac ctcagggtta tatttccctc 
tccccttccc tccccgccaa gagctctgcc aggggcgggc aaaaaaaaaa gtaaaaagaa 
aagaaaaaaa aaaaaaagaa acaaaccacc tctacatatt atggaaagaa aatatttttg 
tcgattctta ttcttttata attatgcgtg gaagaagtag acacattaaa cgattccagt 
tggaaacatg tcacctg 
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gkctykgtkg wtsmagatac agatgtggtg tataaaagcg agaatggaca tgtcattaaa 60 
ctgaatatag aaacaaatgc taccacatta ttattggaaa acacaacttt tgtaaccttc 120 
aaagcatcaa gacattcagt ttcaccagat ttaaaatatg tccttctggc atatgatgtc 180 
aaacagattt ttcattattc gtatactgct tcatatgtga tttacaacat acacactagg 240 
gaagtttggg agttaaatcc tccagaagta gaggactccg tcttgcagta cgcggcctgg 300 
ggtgtccaag ggcagcagct gatttatatt tttgaaaata atatctacta tcaacctgat 360 
ataaagagca gttcattgcg actgacatct tctggaaaag aagaaataat ttttaatggg 420 
attgctgact ggttatatga agaggaactc ctgcattctc acatcgccca ctggtggtca 480 
ccagatggag aaagacttgc cttcctgatg ataaatgact ctttggtacc caccatggtt 540 
atccctcggt ttactggagc gttgtatccc aaaggaaagc agtatccgta tcctaaggca 600 
ggtcaagtga acccaacaat aaaattatat gttgtaaacc tgtatggacc aactcacact 660 
ttggagctca tgccacctga cagctttaaa tcaagagaat actatatcac tatggttaaa 720 
tgggtaagca ataccaagac tgtggtaaga tggttaaacc gacctcagaa catctccatc 780 
ctcacagtct gtgagaccac tacaggtgct tgtagtaaaa aatatgagat gacatcagat 840 
acgtggctct ctcagcagaa tgaggagccc gtgttttcta gagacggcag caaattcttt 900 
atgacagtgc ctgttaagca agggggacgt ggagaatttc accacatagc tatgttcctc 960 
atccagagta aaagtgagca aattaccgtg cggcatctga catcaggaaa ctgggaagtg 1020 
ataaagatct tggcatacga tgaaactact caaaaaattt actttctgag cactgaatct 1080 
tctcccagag gaaggcagct gtacagtgct tctactgaag gattattgaa tcgccaatgc 1140 
atttcatgta atttcatgaa agaacaatgt acatattttg atgccagttt tagtcccatg 1200 
aatcaacatt tcttattatt ctgtgaaggt ccaagggtcc cagtggtcag cctacatagt 1260 
acggacaacc cagcaaaata ttttatattg gaaagcaatt ctatgctgaa ggaagctatc 1320 
ctgaagaaga agataggaaa gccagaaatt aaaatccttc atattgacga ctatgaactt 1380 
cctttacagt tgtcccttcc caaagatttt atggaccgaa accagtatgc tcttctgtta 1440 
ataatggatg aagaaccagg aggccagctg gttacagata agttccatat tgactgggat 1500 
tccgtactca ttgacatgga taatgtcatt gtagcaagat ttgatggcag aggaagtgga 1560 
ttccagggtc tgaaaatttt gcaggagatt catcgaagat taggttcagt agaagtaaag 1620 
gaccaaataa cagctgtgaa atttttgctg aaactgcctt acattgactc caaaagatta 1680 
agcatttttg gaaagggtta tggtggctat attgcatcaa tgatcttaaa atcagatgaa 1740 
aagcttttta aatgtggatc cgtggttgca cctatcacag acttgaaatt gtatgcctca 1800 
gctttctctg aaagatacct tgggatgcca tctaaggaag aaagcactta ccaggcagcc i860 
agtgtgctac ataatgttca tggcttgaaa gaagaaaata tattaataat tcatggaact 1920 
gctgacacaa aagttcattt ccaacactca gcagaattaa tcaagcacct aataaaagct i960 
ggagtgaatt atactatgca ggtctaccca gatgaaggtc ataacgtatc tgagaagagc 2040 
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aagtatcatc tctacagcac aatcctcaaa ttcttcagtg attgtttgaa ggaagaaata 
tctgtgctac cacaggaacc agaagaagat gaataatgga ccgtatttat acagaactga 
agggaatatt gaggctcaat gaaacctgac aaagagactg taatattgta gttgctccag 
aatgtcaagg gcagcttacg gagatgtcac tggagcagca cgctcagaga cagtgaacta 
gcatttgaat acacaagtcc aagtctactg tgttgctagg ggtgcagaac ccgtttcttt 
gtatgagaga ggtcaaaggg ttggtttcct gggagaaatt agttttgcat taaagtagga 
gtagtgcatg ttttcttctg ttatccccct gtttgttctg taactagttg ctctcatttt 
aatttcactg gccaccatca tctttgcata taatgcacaa tctatcatct gtcctacagt 
ccctgatctt tcatggctga gctgcaatct aacactttac tgtaccttta taataagtgc 
aattctttca ttgtctatta ttatgcttaa gaaaatattc agttaataaa aaacagagta 
ttttatgtaa tttctgtttt taaaaagaca ttattaaatg ggtcaaagga catatagaaa 
tgtggatttc agcaccttcc aaagttcagc cagttatcag tagatacaat atctttaaat 
gaacacacga gtgtatgtct cacaatatat atacacaagt gtgcatatac agttaatgaa 
actatcttta aatgttattc atgctataaa gagtaaacgt ttgatgaatt agaagagatg 
ctcttttcca agctataatg gatgctttgt ttaatgagcc aaatatgatg aaacattttt 
tccaattcaa attctagcta ttgctttcct ataaatgttt gggttgtgtt tggtattgtt 
tttagtggtt aatagttttc cagttgcatt taattttttg aatatgatac cttgtcacat 
gtaaattaga tacttaaata ttaaattata gtttctgata aagaaatttt gttaacaatg 
caatgccact gagtgctatt ttgctctttt ggtggagaag gcttttttca aaactcttgg 
tccttttact tctttctctc agtgcagaat caattctcat tttcatcgta aaagcaaata 
gctggattat cccatttgcc agttcctatfc tagtattcca tgcctgccca attcatctgt 
tactgtttaa tttcaattct tctggtgaga attagaaatg aaatattttt tattcattgg 
ccaaaaagtt cacagacagc agtgtttgct atttactttg aattgaaggc acaaaatgca 
tcaattcctg tgctgtgttg acttgcagta gtaagtaact gagagcataa aataaacctg 
actgtatgaa gtcaatttaa gtgatgagaa catttaactt tggtgactaa agtcagaata 
tcttctcact tcacttaagg gatcttccag aagatatcta aaagtctgta ataagcttag 
aagttcagat aaatctaggc aggatactgc atttttgtgg ttttaaaaaa gtccttagga 
cagactgaat tatcataact tatggcatca ggaggaaact ttaaaatatc aaggaatcac 
tcagtcaccc tcctgttttg ttgaaggatc aaccccaaat tctgggtatt tgagtacatg 
tgaatcatgg atttggtatt caactttttc cctggatgct ttggaatcgt gtcttccatg 
ctccactggg ttcaatttaa aataggagag gctttctctt ctgaaagatc cattttaggt 
ctttttcaag aatagtgaac acatttttta acaaaataag ttgtaatttt aaaaggaaag 
ttttgcctat tttattaaga tggaaatttc tttttaggct aatttgaaat ccaactgaag 
ctttttaacc aatattttaa atttgaacca ctagagtttt ttatgatgca aatgattatg 
ttgtctgaaa ggtgtggttt tattgaatgt ctatttgagt atcatttaaa aagtatttgc 
cttttactgt catcatttct cttgttttat tattattatc aatgtttatc tatttttcaa 
ttaatttaat acagtttcta atgtgaaaga catttttctg gaacccgttt tccccttaaa 
cactaaagag acctcaagtg aaagcatatt gcttagtagg aaggtagaaa atgttaatcc 
ctgcgattct ttgagtttta atgacagggt cattttcagt aaaggaaatg ctcaccaaca 
catagtcacc aactattaaa ggaatcatgt gattggattt tcccctgtat acatgtaccc 
ttggtcataa tcccactatt tcatacatat ttatgcattg ctagattttc ctaggactcc 
aatagcatgc tttccaagtg ttattattcc cttaatgtta a 
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Tyr 

Glu 

Glu 

Glu 




135 





140 





Ser 

His 

lie 

Ala 

His 

Trp 

Trp 

Ser 

Pro 

Asp 

Gly 

Glu 

Arg 



150 





155 





160 

Leu 

Met 

lie 

Asn 

Asp 

Ser 

Leu 

Val 

Pro 

Thr 

Met 

Val 

lie 


165 





170 





175 


Thr 

Gly 

Ala 

Leu 

Tyr 

Pro 

Lys 

Gly 

Lys 

Gin 

Tyr 

Pro 

Tyr 

180 





185 





190 



Gly 

Gin 

Val 

Asn 

Pro 

Thr 

lie 

Lys 

Leu 

Tyr 

Val 

Val 

Asn 





200 





205 




Pro 

Thr 

His 

Thr 

Leu 

Glu 

Leu 

Met 

Pro 

Pro 

Asp 

Ser 

Phe 




215 





220 





Glu 

Tyr 

Tyr 

lie 

Thr 

Met 

val 

Lys 

Trp 

Val 

Ser 

Asn 

Thr 



230 





235 





240 

Val 

Arg 

Trp 

Leu 

Asn 

Arg 

Pro 

Gin 

Asn 

lie 

Ser 

lie 

Leu 


245 





250 





255 


Glu 

Thr 

Thr 

Thr 

Gly 

Ala 

Cys 

Ser 

Lys 

Lys 

Tyr 

Glu 

Met 

260 





265 





270 



Thr 

Trp 

Leu 

Ser 

Gin 

Gin 

Asn 

Glu 

Glu 

Pro 

Val 

Phe 

Ser 





280 





285 




Ser 

Lys 

Phe 

Phe 

Met 

Thr 

val 

Pro 

Val 

Lys 

Gin 

Gly 

Gly 




295 





300 





Phe 

His 

His 

lie 

Ala 

Met 

Phe 

Leu 

He 

Gin 

Ser 

Lys 

Ser 



310 





315 





320 

Thr 

Val 

Arg 

His 

Leu 

Thr 

Ser 

Gly 

Asn 

Trp 

Glu 

Val 

lie 


325 





330 





335 


Ala 

Tyr 

Asp 

Glu 

Thr 

Thr 

Gin 

Lys 

lie 

Ser 

Ala 

Ser 

Thr 

340 





345 





350 



Leu 

Asn 

Arg 

Gin 

Cys 

lie 

Ser 

Cys 

Asn 

Phe 

Met 

Lys 

Glu 





360 





365 




Tyr 

Phe 

Asp 

Ala 

Ser 

Phe 

Ser 

Pro 

Met 

Asn 

Gin 

His 

Phe 




375 





380 





Cys 

Glu 

Gly 

Pro 

Arg 

Val 

Pro 

Val 

Val 

Ser 

Leu 

His 

Ser 



390 





395 





400 

Pro 

Ala 

Lys 

Tyr 

Phe 

lie 

Leu 

Glu 

Ser 

Asn 

Ser 

Met 

Leu 


405 





410 





415 


lie 

Leu 

Lys 

Lys 

Lys 

lie 

Gly 

Lys 

Pro 

Glu 

lie 

Lys 

lie 

420 





425 





430 



Asp 

Asp 

Tyr 

Glu 

Leu 

Pro 

Leu 

Gin 

Leu 

Ser 

Leu 

Pro 

Lys 





440 





445 




Asp 

Arg 

Asn 

Gin 

Tyr 

Ala 

Leu 

Leu 

Leu 

lie 

Met 

Asp 

Glu 




455 





460 





Gly 

Gin 

Leu 

Val 

Thr 

Asp 

Lys 

Phe 

His 

lie 

Asp 

Trp 

Asp 



470 





475 





480 

lie 

Asp 

Met 

Asp 

Asn 

Val 

lie 

Val 

Ala 

Arg 

Phe 

Asp 

Gly 


485 





490 





495 


Gly 

Phe 

Gin 

Gly 

Leu 

Lys 

lie 

Leu 

Gin 

Glu 

lie 

His 

Arg 

500 





505 





510 



Ser 

Val 

Glu 

Val 

Lys 

Asp 

Gin 

lie 

Thr 

Ala 

Val 

Lys 

Phe 





520 





525 




Leu 

Pro 

Tyr 

lie 

Asp 

Ser 

Lys 

Arg 

Leu 

Ser 

lie 

Phe 

Gly 




535 





540 
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Lys 

Gly 

Tyr 

Gly 

Gly 

Tyr 

lie 

Ala 

Ser 

Met 

lie 

Leu 

Lys 

Ser 

Asp 

Glu 

545 





550 





555 





560 

Lys 

Leu 

Phe 

Lys 

Cys 

Gly 

Ser 

Val 

Val 

Ala 

Pro 

lie 

Thr 

Asp 

Leu 

Lys 





565 





570 





575 


Leu 

Tyr 

Ala 

Ser 

Ala 

Phe 

Ser 

Glu 

Arg 

Tyr 

Leu 

Gly 

Met 

Pro 

Ser 

Lys 




580 





585 





590 



Glu 

Glu 

Ser 

Thr 

Tyr 

Gin 

Ala 

Ala 

Ser 

Val 

Leu 

His 

Asn 

Val 

His 

Gly 



595 





600 





60S 




Leu 

Lys 

Glu 

Glu 

Asn 

lie 

Leu 

He 

lie 

His 

Gly 

Thr 

Ala 

Asp 

Thr 

Lys 


610 





615 





620 





Val 

His 

Phe 

Gin 

His 

Ser 

Ala 

Glu 

Leu 

lie 

Lys 

His 

Leu 

lie 

Lys 

Ala 

625 





630 





635 





640 

Gly 

Val 

Asn 

Tyr 

Thr 

Met 

Gin 

Val 

Tyr 

Pro 

Asp 

Glu 

Gly 

His 

Asn 

Val 





645 





650 





655 


Ser 

Glu 

Lys 

Ser 

Lys 

Tyr 

His 

Leu 

Tyr 

Ser 

Thr 

lie 

Leu 

Lys 

Phe 

Phe 




660 





665 





670 



Ser 

Asp 

Cys 

Leu 

Lys 

Glu 

Glu 

lie 

Ser 

val 

Leu 

Pro 

Gin 

Glu 

Pro 

Glu 



675 





680 





685 




Glu 

Asp 

Glu 















690 
















<210> 44 

<211> 4496 

<212> DNA 

<213 > Homo sapiens 

<400> 44 

gkctykgtkg wtsmagatac agatgtggtg tataaaagcg agaatggaca tgtcattaaa 60 

ctgaatatag aaacaaatgc taccacatta ttattggaaa acacaacttt tgtaaccttc 120 

aaagcatcaa gacattcagt ttcaccagat ttaaaatatg tccttctggc atatgatgtc 180 

aaacagattt ttcattattc gtatactgct tcatatgtga tttacaacat acacactagg 240 

gaagtttggg agttaaatcc tccagaagta gaggactccg tcttgcagta cgcggcctgg 300 

ggtgtccaag ggcagcagct gatttatatt tttgaaaata atatctacta tcaacctgat 360 

ataaagagca gttcattgcg actgacatct tctggaaaag aagaaataat ttttaatggg 420 

attgctgact ggttatatga agaggaactc ctgcattctc acatcgccca ctggtggtca 480 

ccagatggag aaagacttgc cttcctgatg ataaatgact ctttggtacc caccatggtt 540 

atccctcggt ttactggagc gttgtatccc aaaggaaagc agtatccgta tcctaaggca 600 

ggtcaagtga acccaacaat aaaattatat gttgtaaacc tgtatggacc aactcacact 660 

ttggagctca tgccacctga cagctttaaa tcaagagaat actatatcac tatggttaaa 720 

tgggtaagca ataccaagac tgtggtaaga tggttaaacc gacctcagaa catctccatc 780 

ctcacagtct gtgagaccac tacaggtgct tgtagtaaaa aatatgagat gacatcagat 840 

acgtggctct ctcagcagaa tgaggagccc gtgttttcta gagacggcag caaattcttt 900 

atgacagtgc ctgttaagca agggggacgt ggagaatttc accacatagc tatgttcctc 960 

atccagagta aaagtgagca aattaccgtg cggcatctga catcaggaaa ctgggaagtg 1020 

ataaagatct tggcatacga tgaaactact caaaaaatca gtgcttctac tgaaggatta 1080 

ttgaatcgcc aatgcatttc atgtaatttc atgaaagaac aatgtacata ttttgatgcc 1140 

agttttagtc ccatgaatca acatttctta ttattctgtg aaggtccaag ggtcccagtg 1200 

gtcagcctac atagtacgga caacccagca aaatatttta tattggaaag caattctatg 1260 

ctgaaggaag ctatcctgaa gaagaagata ggaaagccag aaattaaaat ccttcatatt 1320 

gacgactatg aacttccttt acagttgtcc cttcccaaag attttatgga ccgaaaccag 1380 

tatgctcttc tgttaataat ggatgaagaa ccaggaggcc agctggttac agataagttc 1440 

catattgact gggattccgt actcattgac atggataatg tcattgtagc aagatttgat 1500 

ggcagaggaa gtggattcca gggtctgaaa attttgcagg agattcatcg aagattaggt 1560 

tcagtagaag taaaggacca aataacagct gtgaaatttt tgctgaaact gccttacatt 1620 

gactccaaaa gattaagcat ttttggaaag ggttatggtg gctatattgc atcaatgatc 1680 

ttaaaatcag atgaaaagct ttttaaatgt ggatccgtgg ttgcacctat cacagacttg 1740 

aaattgtatg cctcagcttt ctctgaaaga taccttggga tgccatctaa ggaagaaagc 1800 

acttaccagg cagccagtgt gctacataat gttcatggct tgaaagaaga aaatatatta 1860 

ataattcatg gaactgctga cacaaaagtt catttccaac actcagcaga attaatcaag 1920 
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cacctaataa aagctggagt 
gtatctgaga agagcaagta 
ttgaaggaag aaatatctgt 
tttatacaga actgaaggga 
ttgtagttgc tccagaatgt 
agagacagtg aactagcatt 
agaacccgtt tctttgtatg 
tgcattaaag taggagtagt 
agttgctctc attttaattt 
catctgtcct acagtccctg 
ctttataata agtgcaattc 
ataaaaaaca gagtatttta 
aaggacatat agaaatgtgg 
acaatatctt taaatgaaca 
tatacagtta atgaaactat 
gaattagaag agatgctctt 
tgatgaaaca ttttttccaa 
gtgtttggta ttgtttttag 
gataccttgt cacatgtaaa 
attttgttaa caatgcaatg 
tttcaaaact cttggtcctt 
tcgtaaaagc aaatagctgg 
gcccaattca tctgttactg 
ttttttattc attggccaaa 
aaggcacaaa atgcatcaat 
cataaaataa acctgactgt 
actaaagtca gaatatcttc 
ctgtaataag cttagaagtt 
aaaaagtcct taggacagac 
atatcaagga atcactcagt 
gtatttgagt acatgtgaat 
atcgtgtctt ccatgctcca 
agatccattt taggtctttt 
attttaaaag gaaagttttg 
gaaatccaac tgaagctttt 
atgcaaatga ttatgttgtc 
ttaaaaagta tttgcctttt 
ttatctattt ttcaattaat 
cgttttcccc ttaaacacta 
agaaaatgtt aatccctgcg 
aaatgctcac caacacatag 
tgtatacatg tacccttggt 
ttttcctagg actccaatag 

<210> 45 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 45 

cggtaccatg gcagcagcaa 

<210> 46 

<211> 39 

<212> DNA 

<213> Homo sapiens 

<4 00> 46 

ggagctcgcg gccgctcata 


gaattatact atgcaggtct 
tcatctctac agcacaatcc 
gctaccacag gaaccagaag 
atattgaggc tcaatgaaac 
caagggcagc ttacggagat 
tgaatacaca agtccaagtc 
agagaggtca aagggttggt 
gcatgttttc ttctgttatc 
cactggccac catcatcttt 
atctttcatg gctgagctgc 
tttcattgtc tattattatg 
tgtaatttct gtttttaaaa 
atttcagcac cttccaaagt 
cacgagtgta tgtctcacaa 
ctttaaatgt tattcatgct 
ttccaagcta taatggatgc 
ttcaaattct agctattgct 
tggttaatag ttttccagtt 
ttagatactt aaatattaaa 
ccactgagtg ctattttgct 
ttacttcttt ctctcagtgc 
attatttcat ttgccagttt 
tttaatttca attcttctgg 
aagttcacag acagcagtgt 
tcctgtgctg tgttgacttg 
atgaagtcaa tttaagtgat 
tcacttcact taagggatct 
cagataaatc taggcaggat 
tgaattatca taacttatgg 
caccctcctg ttttgttgaa 
catggatttg gtattcaact 
ct 999 ttcaa tttaaaatag 
tcaagaatag tgaacacatt 
cctattttat taagatggaa 
taaccaatat tttaaatttg 
tgaaaggtgt ggttttattg 
actgtcatca tttctcttgt 
ttaatacagt ttctaatgtg 
aagagacctc aagtgaaagc 
attctttgag ttttaatgac 
tcaccaacta ttaaaggaat 
cataatccca ctatttcata 

t 

catgctttcc aagtgttatt 


tggaaacag 


tcacttttag agcagcaat 


acccagatga aggtcataac 
tcaaattctt cagtgattgt 
aagatgaata atggaccgta 
ctgacaaaga gactgtaata 
gtcactggag cagcacgctc 
tactgtgttg ctaggggtgc 
ttcctgggag aaattagttt 
cccctgtttg ttctgtaact 
gcatataatg cacaatctat 
aatctaacac tttactgtac 
cttaagaaaa tattcagtta 
agacattatt aaatgggtca 
tcagccagtt atcagtagat 
tatatataca caagtgtgca 
ataaagagta aacgtttgat 
tttgtttaat gagccaaata 
ttcctataaa tgtttgggtt 
gcatttaatt ttttgaatat 
ttatagtttc tgataaagaa 
cttttggtgg agaaggcttt 
agaatcaatt ctcattttca 
ctatttagta ttccatgcct 
tgagaattag aaatgaaata 
ttgctattta ctttgaattg 
cagtagtaag taactgagag 
gagaacattt aactttggtg 
tccagaagat atctaaaagt 
actgcatttt tgtggtttta 
catcaggagg aaactttaaa 
ggatcaaccc caaattctgg 
ttttccctgg atgctttgga 
gagaggcttt ctcttctgaa 
ttttaacaaa ataagttgta 
atttcttttt aggctaattt 
aaccactaga gttttttatg 
aatgtctatt tgagtatcat 
tttattatta ttatcaatgt 
aaagacattt ttctggaacc 
atattgctta gtaggaaggt 
agggtcattt tcagtaaagg 
catgtgattg gattttcccc 
catatttatg cattgctaga 
attcccttaa tgttaa 
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2940 
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3360 

3420 

3480 
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3660 

3720 

3780 

3840 
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4020 

4080 

4140 

4200 
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4320 
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<210> 47 

<211> 27 

<212> DNA 
<213 > Homo sapiens 
<400> 47 

caagctttat cacttttaga gcagcaa 

<210 > 48 

< 211 > 22 
<212 > DNA 
<213 > Homo sapiens 
<4 00> 48 

cacattcttg ctgcatcagt ca 

<210> 49 

< 211 > 22 

<212> DNA 

< 213 > Homo sapiens 


ttgggtcatc ttcaggactt ga 

<210> 50 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 50 


caagcttacc atggccacca ccgggac 

<210> 51 

<211> 37 

<212> DNA 

<213> Homo sapiens 

<400> 51 

cggatccgcg gccgctcaga ggtattcctg tagaaag 

<210> 52 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 52 

cggatccagg tattcctgta gaaagtg 

<210> 53 

< 211 > 20 
<212> DNA 
<213> Homo sapiens 
<400> 53 

tacgccgtgg ttgtgattga 

<210> 54 

< 211 > 20 
<212> DNA 
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22 


27 


37 


27 
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<213> Homo sapiens 
<400> 54 


ccatacttct cggccacgaa 

<210> 55 

<211> 19 

<212> DMA 

<213 > Homo sapiens 

<400> 55 


gcctgggatt gtgcactgt 

<210> 56 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 56 

gtgtattcaa atgctagttc actgtctct 

<210> 57 

< 211 > 22 

<212> DMA 

<213> Homo sapiens 

<400> 57 

agctagcact gtccagggtc ct 

<210> 58 

<211> 25 

<212> DMA 

<213> Homo sapiens 

<400> 58 


agggcccttc atcttcttct ggttc 


<210> 59 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 59 


Val Glu Asp Asp Val Met Glu Arg Gin Arg Leu lie Glu Ser Val Pro 
15 10 15 

Asp Ser Val 

<210> 60 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 60 

Ser Thr Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg Ala Val Pro 
15 10 15 

Arg Asn Val 

<210> 61 
<211> 15 
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<212 > PRT 

<213> Homo sapiens 

<400> 61 

Lys Glu Ala lie Leu Lys Lys Lys lie Gly Lys Pro Glu lie Lys 
1 5 10 15 
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